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Section I. Introduction 



The primary goal of the studies carried out under this cont‘'act 
was to Investigate the effects of interpolated items on the learning 
and retention of individual stimulus-response (S-R) units in paired- 
associate learning. Specific experiments which are reported in Sec- 
tion II were designed to determine the effects of interference pro- 
duced by other items which occur naturally within the context of a 
standard paired-associate task. The empirical results were useful 
in formulation and evaluation of alternative mathematical models 
for acquisition and retention of verbal information. 

In addition to this primary goal, several investigations were 
carried out with the support provided by the contract in the yenc-^ i 

processing of verbal information. The thread 
wnich ties together the enti’-e research program is the search for 
optpal strategies for the presentation of verbal material.' In a 
series of Investigations reported in Section III, the effects of 
variations in list length (the number of items to be learned s.rul- 
taneously) are studied v/ithin the context of a pair of learning 
models, the first a discrete-state Markov process, and the second 
involving a convolution of incremental acquisition and forgetting 
processes. A related problem is the relative efficiency of various 
part-list procedures. A study is reported in which German-English 
pairs are learned under various part-list procedures, and the data 
are predicted by a two-operator linear model. 



A final series of experiments, reported in Section IV, are 
concerned with short-term memory phenomena in the processing of 
verbal infortnation. In the first study, a factorial comparison 
IS of visual and auditory input and output operations on 
reaction tine in a simple memory task. Both visually and acousti- 
cally con fuspg sets of materials were used, as well as a neutral 
set. An auxilliary study was performed to obtain en empirical con- 
fusion matrix based on visually similar letters. 



^ A final set of studies on short-term memory was carried out 
using pre-school children as subjects. The main variables of 
interest were memory load (number of items to be recalled) and 
load Information (information prior to presentation about the 
number of items to be presented in a serial list) 



Section II: Interpresentation effects in paired-associate learning 

. problem : In a paired-associate task, the sequence 

of presentations of a particular S-R item is imbedded in a rcmnipr 

“"^'^Ised of the presentations of other items in the lisU 
Between successive presentations of a given item from a list of M 
pairs are interpolated from zero to 2(N-1) other items, for each ot 



which the state of learning may range from complete ignorance to 
well-acquiired. If the anticipation technique is used, an additional 
factor is introduced, viz., the amount of time and/or effort which 

the subject expends in trying to come up with the correct response 
for each item. 



The effects of variations in interpresentation events on the 
course of learning a given item are of Interest in part because 
of recent theoretical developments in statistical learning models. 
For example, the trial -dependent forgetting (TDF) model (Calfee & 
Atkpson, 1965) incorporates explicit assumptions about the effects 
of interpolated items on the acquisition of a given pair. Ai»,er 
a study presentation, a pair is assumed to go into a short-term or 
Jong-term memory store. An item in the latter >tate is considered 
to be learned. If the item is in short-term memory, then succeeding 
presentations may result in forgetting, represented as transition 
to a guessing state. Estimates of the transition probabilities 
from several independent sets of data, each of which the model 
handled rather well (Calfee & Atkinson, 1965; Calfee, 1956a), all 
indicated that learning was much more likely to occur if an item 
i^mained in short-term memory since its last presentation than if 
It had been forgotten. One implication is that massing of presen- 
tations should lead to faster learning, a prediction which is not 
in accord with numerous other findings. Greeno (1964), using a 
design in which some items occurred twice in succession on each 
rial, found that the second presentation produced little learning; 
the same number of trials to criterion (twice the number of presen- 
tations) was required for the experimental pairs as for those pre- 
sented once per trial. Rothkopf and Coke (1963, 1966) found in a 
free-recall task that better retention took place if successive 
repetitions of items were spaced rather than massed. 



Greeno (1966) has argued for a theoretical model in which an 
Item must be forgotten to be learned; i.e., if an item is in short- 
term memory Just prior to a study presentation, the study presenta- 
tion will have no effect. An identifiable theory is stated involv- 
ing transition parameters for transfer to long-term memorv from 
either short-term or guessing states. The theory is applied to 
data from Peterson's laboratory (Peterson, Wampler, Kirkpatrick & 
Saltzman, 1963; Peterson, Hi liner & Saltzman, 1962) as well as 
Greeno s. In general, transition from the guessing state is three 

to four times more likely than from the short-term state accord! no 
to Greeno s analysis. ^ 



In the first experiment to be reported in this paper, Greern'' 
twice-per-trial technique is used to ivaluate parametrically the 
emets of massing on learning rate. In a second experiment, a 
different type of mixed-list design is used to determine effects 
of massing presentations, with the proviso that at least one other 
Item intervene between successive presentations of a given pair 
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The third experiment tests another assumption incorporated in the 
TDF model concerning the effects C’F interpolated items# viz., that 
the amount of interference produced by unlearned items is substan- 
tially greater than interference from learned items. 

Method Experiment I 

Subjects . Forty undergraduates enrolled in introductory 
psychology at the University of Wisconsin, volunteered to serve 
as Ss, and were assigned in an alternating manner to the High and 
Low Meaningful ness conditions. 

Materials . Two stimulus sets of 15 CVC tri grams were seiCwter! 
using Archer’s (I960) norms, a high association (HA) set (X=85, 
range 80-90) and a low association (LA) set (X=14, range 10-17). 

To obtain sets with minimal intralist similarity, no 

consonant was used in the first or last positions more than once, 
and the vowels (including Y) were evenly distributed within each 
set. Response terms consisted of two digit numbers; the digits 
1-9 were used with approximately equal frequency, repeating numbers 
were not a^^ov^fed, and the choice was otherwise random. Pairings 
of CVC s with two-digit numbers was randomized separately for each 



Each 15-item list was divided into 5 sublists of approximately 
equal association value. Sublists wore then assigned to one of 
the 5 spacing conditions described below according to a Latin square 
design, which was randomly selected for each block of 5 subjects. 

In condition N, the 3 itens were presented once per trial. In the 
other conditions, each ^ item was presented twice per trial with 0, 

1, 2 or 3 other pairs interpolated between the two presentations 
for the respective conditions. Hence, if the pair HYP-27 was in 
a sublist assigned to Condition 0, then on each trial that pair 
would be presented twice j*n succession with no other pairs inter- 
polated. Between- and w1 thin-trial spacing are necessarily con- 
founded — the more interpolated pairs between the two prese...atto.v;, 
on a given trial, the fewer pairs falling between the second presen- 
tation on trial n and the first presentation on trial n+1. A trial 
consisted of presentation of 27 pairs, however, and the between- 
trial difference in interpolated pairs is proportionately much 
smaHer than wi thin-trial variation. 

An anticipation procedure was used. The stimulus term was 
presented by a Carousel projector, and the S m% given as much 
time as needed to make a response. Response terms were always 
available for reference on a card in front of the S^. Following 
^s response, the correct response was presented together vjith 
the stimulus for a 2-sec. feedback period. 
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Procedure . Following instructions about the paired-associate 
task, S was given a 3-item practice list (A, B and C paired with 
3, 5 and 7) to a criterion of two errorless trials. Any questions 
were answered by appropriate rephrasing of the instructions. The 
expentnental list was then presented with no trial breaks except 
to change slide trays. Within the constraints of the spacing 
conditions, order of presentation of items was random, except that 
an item at the end of one trial was never permitted to be the first 
Item on the next trial. A^response was required on each trial, and 
was encouraged to guess if he didn’t know the answer. Training 
continued to a criterion of three consecutive errorless trials. 

Results^. In^Tab'ie lA is presented the average trial of last 
error to a criterion of 3 consecutive correct responses based only 
00 tn6 first proSGotdtioo por tridl. Ao doalysls of vdriainco 
showed that spacing had a significant effect, F(4,152) = 8.6, 

association level approached significance, F(l,38) = 3.2, 

. 10<«.p-^’^_^.05, and there was no interaction between the two varia- 
bles. The more spaced are the presentations, the more efficient 
IS acquisition, a finding which is even more obvious in Table IB 
where presentations to criterion is presented, rather than trials. 



In Table 2 are presented (A) total errors to criterion based 
on first presentation only, (B) based on second presentation only, 
and (C) summed over both presentations. Analyses of variance on 
these statistics Indicated that effects of spacing conditions were 
significant in all cases, (A) F(4,152)=8.2, p<.01, (B) F(3,114)= 

8. , p^.OOl, (C) F(4,l()2)=8.3, pc. 01. Association level t, oa ’led 
significance for statistic A, F(l,38)=3.0, .10<p^.05. The F-rat;) 
for association level by spacing was less than 1 for all statistic, 
trrors based on the second presentation reflect short-term retentio* . 
There are virtually no errors on the second presentation for zero 
interpolated pairs; as the number of interpolated pairs increases 
errors on the second presentation increase in a negatively acceler- 
a ed fashion. By the time three pairs intervene before the second 
presentation there is a difference of only one error, on the avera'^e, 
between performance on first and second presentations. 



Errors to criterion for conditions 0 and H are virtually iden- 
tical, while considerably more errors were required for the inter- 
mediate spacing conditions. The possibility was considered that 

memory errors, i.e. those errors occurring on the second 
presentation, might be relatively ineffective in the learninq 
process. However, the probability that a criterion run started 
on trial n+1 was equal to .093 in group HA if conditionalized on 
an error on the first presentation on trial n, and equal to .086 
for an e'^rpr on the second presentation; the correspond! nq values 
for group LA are .072 and .082. A criterion run is just abdut as 
likely to follow a second-presentation as a first-presentation error. 
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Table lA 



Mean Trial of Last Error to Criterion 
of Three Successive Correct Responses on 
First Presentation 



Group 



High Association Low Association 



Interpolated Item 
Condition 



0 

1 

2 

3 

H 



4.8 

4.0 

4.1 

3.9 
5.4 



6.0 

5.2 

5.2 

4.9 

6.4 



Table IB 

Total Presentations of Last Error to Criterion 
C 4 . Successive Correct Responses 

Starting with First Presentation on a Trial 



Group 


High Association 


Lpvi/ Association 


Interpolated Item 

Condition 






0 


8.6 


11.0 


1 


7.0 


9.5 


2 


7.1 


9.5 


3 


6.8 


9.1 




5.4 


6.4 
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Table 2 



Nean Total Errors on First, Second or 
Combined Presentations per Trial to 

Successive Correct Responses, 
with Predicted Values from Stimulus Fluctuation (SF) 
and Random-Acquisition-Forgetting model 



Group 


High Association 


Low Association 


Presentati 


on 


1st 


2nd 


lst&2nd 


1st 


2nd 


lst&2t 


Interpolated Item 
















Condition 












1 


0 




Obs. 


4.3 


.4 


4.7 


5,3 


.5 


5.8 




Pred. 


SF 


4.7 


0 


4.7 


5.6 


0 


5.6 






RAF 


4.0 


0 


4.0 


4.9 


0 


4.9 


1 




Obs. 


3.6 


1.7 


5.4 


4.6 


2.2 


6.8 




Pred. 


SF 


3.4 


1.3 


4.7 


4.0 


1 .0 


5.6 






RAF 


4.0 


1.5 


5.5 


4.9 


1.8 


6.7 


2 




Obs. 


3.7 


2.4 


6.1 


4.7 


2.8 


7.5 


1 


Pred. 


SF 


3.0 


1.7 


4.7 


3.6 


2.1 


5.7 






RAF 


4.0 


2.3 


6.3 


4.-9 


2.9 


7.8 


3 




Obs. 


3.5 


2.5 


6.0 


4.3 


3.1 


7.4 




Pred. 


SF 


2.9 


1.8 


4.7 


3.3 


2.4 


5.7 






RAF 


4.0 


2.8 


6.8 


4.9 


3.5 


8.4 


N 




Obs. 


4.7 




4.7 


5.6 




5.6 1 




Pred. 


SF 


4.7 


••• 


4.7 


5.7 




5.7 


1 , 




RAF 


7.2 


«» w 


7.2 


9.3 


■" 1 


9.3 



Note; Parameter values used for predictions for SF model were 
HA:a^«.52, J*.20, LA:a^=.52, d*.16; for RAF model, HA; 
Cvl*“0(J*. 13, f®.44, LA; c(1-0L)5s. 1O, f*.41. 
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Experiment II 



The general conclusion to be drawn from the first experirriert 
would seem to be that spacing of items substantially enhances the 
effect of each feedback presentation. The second experiment was 
designed to evaluate distribution effects in paired-associate 
learning using a between-^s design. It is occasionally the case 
in psychological research that significant effects of a given 
variable depend on within-^ variation, and are not observed when 
varied between Ss, Results based on the first group of Ss run in 
this experiment in fact showed no effects of spacing, and the 
experiment v^/as therefore replicated, with substantially the same 
results, viz,, no effect. 



Method 

Sybjscts , In the first replication, 45 undergraduates at 
the University of Wisconsin, Madison, served as Ss, In the 
second replication, 48 students volunteered from the same source. 
All ^s were enrolled in an introductory psychology course, and 
received experimental credits in return for their services. Within 
each replication, Ss were assigned at random to one of three exper- 
imental groups, with the restriction that there be an equal number 
of ^s in each group. 

Procedure , Each S^ learned a 15-item paired-associate li^t by 
the a..tic1pation technique. Stimulus members were the low associa- 
tion CVC s from Exp, I; response terms were the set of 2-digit num- 
bers, Instructions and pretraining were identical to Exp, I, as 
was the general presentation procedure. There was no limit on 
response time, and there was no intertrial interval. 

The training procedure consisted of 6 anticipation trials 
followed by. a test trial (stimulus only), and then 6 more antici- 
pation trials follovi/ed by a test trial. Subjects were then given 
a poem to read for about 2 min,, and a final recall trial was 
administered. The three experimental groups differed in the rel- 
ative massing of subsets of experimental items during the two 6- 
trial anticipation series. Group x served as a control group; tfie 
15 items were presented in random order on each trial. In Group 
II 5 th6 list wds dividod into 3 S6ts o*f 5 itGitis 6dch« I*f thosG 
sets are labeled A, B, and C, then A comprised a control set, in 
that each of the 5 items was presented once on each trial. Items 
in sets B and C were presented in massed fashion; items in set B 
were presented twice on all odd-numbered anticipation trials, while 
Items in set C were presented twice on even-numbered anticipation 
trials. In Group III, 6 items were selected for set A, and the 
remaining 9 items were divided into 3 sets (B, C, and D) of 3 items 
each. As before, items in A served as control items and were pre- 
sented once per trial. Items in B, C, and D were presented three 
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times per triai, but each set was used on only one-third of the 
trials. Set B items appeared on trials 1, 4, 10 and 13, set C 
items on 2, 5, 9, and 12, and D items on trials 3, 6, 8 and xi. 
(This assignment of trial numbers and sets, which provides partial 
counterbalancing of sets and order of presentation, was used only 
in the second replication. In the first replication, sets were 
run in the same order in both blocks of anticination trials.' 

These assignment conditions achieve the following results: (1) 

each trial consists of 15 presentations; (2) each item is pre- 
sented for 6 anticipation trials in each of two blocks; (3) a 
spaced control baseline is available both between and within con- 
ditions. From Group I through Group III, there is a marked in- 
crease in massing of items within the list. Thus, in Group I 
(and for control items within each other group), the mean number 
of interpolated pairs is 14; in Group II, the corresponding value 
for experimental items is 6.3, and in Group III it is about 4.2. 

In none of the massed conditions was an item repeated with 0 
interpolated pairs. 



Results 

The primary results can be stated succinctly. Performance 
on the two test trials during training and the final recall test 
showed no effect on massing either between- or within-subjects. 

Mean number of correct responses on each test is presented in 
Table 3. In the first replication, the differences between con- 
ditions, which show a tendency toward better performance in Group 
II, are not significant, F(l,38)=1.2, p<. 10, and in the second 
replication, the means are almost identical at all test points. 

In Table 3 is presented proportion of correct responses for roaced 
and distributed items within Groups II and III. The difference 
between Groups II and III in the first replication approaches 
significance, F(l,28)*3.9, .05<p«cr. 10, but this effect is appar- 
ently fortuitous, since it is not apparent in the second replica- 
tion. The interaction between groups and spaced vs. massed **tems 
within groups in the second replication is significant F(l,150)-8.n, 
p^.05. Spaced items show somewhat better performance than distri- 
buted items in Group II, but this relation is reversed in Group III. 
The same trend is found in the first replication to a much lesser 
degree. However, there is no other difference, within groups, 
between spaced and massed items. 

Experiment HI 

This Investigation is concerned with the amount of interfer- 
ence produced by learned versus unlearned items. In the antici- 
pation technique, both input (study) and output (test) operations 
must be carried out by the subject, each of which has been shown 
to produce substantial interference (Tulving & Arbuckle, 1966). 
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Table 3 



Proportion of connect responses on retention 
test trials for Experiment II 



Replication 1 


(N = 15 per group) 


Group 


Test Trial 
7 14 


2 Minutes 
Post-Acquisition 


I 


.43 


.74 


.73 


II Spaced 


.52 


.84 


.83 


Massed 


.45 


.84 


.84 


III Spaced 


.35 


.68 


.68 


Massed 


.42 


.73 


.88 


Replication 2 


(N * 16 per group) 


Group 


Test Trial 


2 Minutes 




7 


14 


Post-Acquisition 


I 


.42 


.84 


.85 


II Spaced 


.54 


.86 


.86 


Massed 


.39 


.81 


.79 


in Spaced 


.40 


.88 


.88 


Massed 


.44 


.90 


.89 
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In the TDF model. It had been assumed that the largest part of 
interference-produced forgetting could be attributed to input of 
unlearned items. While this assumption is undoubtedly an over- 
simplification, it is of interest to determine the relative effects 
of learned versus unlearned items on acquisition of paired asso- 
ciates. 

Method 

Subjects . A total oi 24 students at the University of Wiscon- 
sin, Madi son , served as Ss, and were assigned at random to one of 
two experimental conditions. Each S^was paid $2.00 for his parti- 
cipation, which required from 35 to 80 min. 

Procedure . A second set of 15 low association CVC's were 
selected from Archer's (1960) list, by the same criteria used to 
pick the original set of 15 items in the first set. The list of 
two-digit response terms was also expanded to 30 items. The 
expanded list contained no double digits (e.g. 33), multiples of 
10, or digit reversals (e.g. 23 and 32). For groups of 4 Ss, two 
in each of the experimental groups, a list of 16 stimulus -response 
terms was selected at random from the pool of 30. Stimulus-respcns • 
pairings were randomly determined, as was presentation order, for 
6 blocks of 4 Ss each. Instructions, pretraining, and general 
procedure were the same as the preceding studies. 

In Condition L (Learned), Ss were given a list consisting 
of 12 pairs from the basic 16-item list for 8 anticipation trials. 
There followed, without interruption, a series of trials on which 
4 E (experimental) items, together with the 12 "learned" items, 
were presented until a criterion of 3 consecutive errorless trials 
was achieved for the 4 E items. Each E pair was presented twice 
per trial. Two of the pairs were vepeated on each trial with 2 
"learned" interpolated items, the othev* 2 .pairs with 4 "learned" 
interpolated items. 

In Condition U (Unlearned), Ss were given stimulus familiar- 
ization training on 12 stimulus terms by means of 8 free recall 
trials. Following this training, each S_ learned a 16-item list 
consisting of the 12 familiar items plus 4 E items, the latter 
repeated on each trial with either 2 or 4 "unlearned" 1 nterpc"**it' 3 d 
pairs. 



Results 

Mean trials to criterion for E pairs in Condition L was 3.5, 
while for Condition U this statistic was 5.5. This difference is 
statistically significant, F(l,22)=5.4, p<c.05. Spacing had no 
effect, nor did any of the variables interact. Hot all Ss had 
acquired the 12-item list by the end of the 8 training trials, and 
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there is evidence that degree of associative performance on the 
prelearned list is directly reflected in rate of learning for th'? 

4 experimental items. For example » in Condition L the rank-order 
correlation between number of errors on the last two training 
trials of the 12-item list and number of errors for E items on 
the second and third experimental trials was equal to .61 which 
differs significantly from 0 at the .05 level. That this corre- 
lation does not simply represent subject-selection factors (the 
better Ss might learn both lists more rapidly) is suggested by 
looking at the same rank-order correlation for Condition U, using 
number of items omitted on the last two free recall trials as the 
covariate, which yields a correlation of .11. Thus, it seems 
reasonable to state that the more completely learned were th iter; 
in the 12-item list, the less interference was produced and the 
more rapidly the performance criterion was achieved. 

Discussion 

The results of the first experiment extend the finding of 
others (Greeno, 1964; Izawa, 1966), that acquisition efficiency, 
measured by trials to criterion, is improved by increasing the 
spacing between successive reinforcements of a paired-associate 
item. A number of different theoretical ideas can be put for- 
ward to account for this result. 

The stimulus fluctuation model of Estes (Estes, 1955a, 1955b; 
Izawa, 1966) which was developed to handle distributional phen- 
omena in verbal learning, makes the general prediction that under 
spaced conditions, fewer presentations should be required to reach 
criterion. The principle assumptions of the model are: 

(1) The stimulus member of each item comprises a set of N* 
cues or stimulus elements, of which a subset of size N 
is available for sampling on each trial. 

(2) Individual elements move from the available to the un- 
available subsets (and vice versa) over time, the proba- 
bility of a transition occurring over any small time 
period M, being a constant value. 

(3) On each trial, the entire set of available elements is 
sampled, and with probability £, those elements in the 
sample not already associated to the correct respor.wO 
become conditioned. 

(4) Each element is either conditioned to the correct response 
or is unconditioned. Items remain in the conditioned stat 
once there. The probability of a correct response during 
a test is equal to the proportion of conditioned elei.ijiti- 
in the sample. 

A number of theoretical predictions can be derived from these 
assumptions which are applicable to the data from the first study 
in particular. In deriving these predictions, we will follow the 
notation introduced by Estes: 



% * probability of an incorrect response before the nth 
reinforced trial 

« probability that a given cue is conditioned after tne 
nth reinforced trial 

J * IT/N* or the proportion of elennents in the available 
sampling set 

a probability of an interchange of cues between the available 
and unavailable subsets during a single presentation inter- 
val . 

Two theorems, proven by Estes, are also useful. First, the proba- 
bility that a cue in the unavailable set during the presentation 
of an item is available k presentations later is J(l-a*^y). Second, 
the probability that a cue available during a presentation, is also 
available k presentations later, is J+(l-0)akt, Finally, it vdll 
be assumed for simplicity that the conditioning parameter, c, is 
equal to 1. 

From these assumptions, it can be shown that the underlying 
conditioning process takes the following form for Experiment I: 

® 9 

Fj = g+(l.g)J«(l-d)g+J 

F2 * Fi+(l.Fi)d(l-akt) m 

« l-(l-g)[l-J(l-akt)] 

F3 * F2+(l-F2)J(l-a(^^**^)^) 

* l-(l-g)(l-d)[l-d(l-akt)|[i.j(i-a(25-kJ^)] 

where is the number of interpolated items between the two presen- 
tations* on a trial. The change from one reinforcement to the next 
follows this pattern: F^j, the number of conditioned cues following 
the nth reinforcement, is* equal to F^-i, plus that proportion of the 
unconditioned cues on n-1 which werelinavailable on rnl, but became 
ava11able«on n^, viz, a proportion equal to J(l-aml^), where m is the 
number of other interpolated items between n-1 and n. The general 
form of Fo on even presentations, which is the value of F^ following 
the second* presentation on a trial for repeated items, is“" 

■"o * 3 

.1 n-2 

Fn * l-(l-g)(l-J)ll-0(l-akt)]T* H-0(l-a(25-k)1)]-7" , 

n » 2, 4, 6... 

*“*?i 



K 







The form of the equation for o^ presentations, i.e., following 
the first presentation on a tnal, is 

Fp * 1“{ 1-g) ( 1“J) 1-a^^) 1 ll-J( 1-a^^^ k)t) ^ ^ 

[3] 



n « 1, 3, 5... 



The probability of an error on the test trial immediately pre- 
ceding the nth reinforcement is equal to the probability that 
each individual element is In the unconditioned state, times the 
probability that the each element which was in the available sam- 
ple on the preceding feedback event (and hence was in the conditioned 
state) has moved to the unavailable state during the intervening 
time period (cf, Izawa, 1966, p. 912 for a more detailed derivation). 
Hence 



rERlC 



% = (l-FnHl-a"'*) 

where m is the number of interpolated items since the last study 
item. For even values of n, i.e., for the first test presen’ “^'on 
on each trial, m is equal to 0, 1, 2 and 3 for the various groups 
of items. For odd values of n, m is equal to 25, 24, 23 and 22 fo 
the same respective groups. By summing the error probabilities 
over all trials, a theoretical expression for total errors to cri- 
terion on the first and second presentations on each trial for re- 
peated items can be obtained: 

Ti = ( 1-9) ( 1-J) ( l-a(25-k) t) j i-j( i-akt) ] 

1- [1-J( 1-akt) ] [1- J( l-a(25-k) t) j 

T2 = (l-9)(l-J)(l-a'^t) [5] 

l-ll-a(l-akt) ) (l-j(l-a(25-k)t) j 

Finally by summing Tj and T 2 , it can be shown that overall total 
errors to criterion, T, should be constant over all spacing condi- 
tions. 



T = Tj+T 2 » (1-g) 

J 



[ 6 ] 



The effect of increasing the number of interpolated items is to 
increase the errors on the second presentation on a trial, 
there should be a corresponding decrease in the number of errors 
on the first presentation. In fact, total errors in the H and 0 
item groups were equal, as predicted. However, the number of total 
er^rors In the ^ and 2 ^tem groups were significantly greater 
than the other two groups. Estimates were obtained by a least- 
squares procedure for the Low and High association lists. It was 
found that the parameter, a^, was constant over the two associatior. 
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levels at a value of .52. The parameter J was equal to .16 in the 
Low group, and to .20 in the High group. The fact that material 
of high association value has a higher proportion of cues available 
for sampling could arise either because more cues are available on 
each trial from a base set which is invariant in number over asso- 
ciation levels, or because with meaningful material, a smaller 
base set of cues Is necessary for adequate performance, while the 
number of cues ir. the available sample is constant over associa- 
tion levels. Without a much more detailed analysis, which is not 
appropriate in view of the failure of the basic prediction of con- 
stant total errors, it is not possible to choose between these two 
alternative interpretations. 

These parameter values were used to predict the eighteen 
observed statistics in Table 2. As may be seen, the fit is not 
very good for either first or second presentations in item groups 
1^, 2 and reflecting the fact that the prediction of constant 
total errors is not met by the data. The error in excess of the pre- 
dictions are shared about equally by the first and second presenta- 
tions. 

A second theoretical analysis, which rests on a two-component 
memory (i.e., postulation of separate long-term and short-term 
memory processes), assumes that for a transition into long-term 
memory storage to occur, an item must have been forgotten sij.Cvi 
the previous reinforcement. Greeno (1966) presents a Markov modol 
for verbal learning in which on each presentation, an item can be 
in one of three states, L (long-term memory), S (short-term memory) 
and F (forgotten). The general form of the model is described by 
the transition matrix and column vector of response probabilities 
below. 



State on 
trial n 



State on trial n+1 



Pr( Correct I row state) 



L 

S 

F 



L S 

"1 0 

c (l-c)h 
d (l-d)h 



P 

0 1 
( 1-c) ( 1-h) 
(l-d)(l-h) 




As mentioned earlier, Greeno presents evidence from several 
experiments in support of the statement that transition to L occurs 
only from F, i.e., £ = 0. In order to apply the model with this 
restriction (c * 0) to the present study, let h = l-(l-x™), where 
m is the number of interpolated items since the last reinforc'^mep'^ . 
(That is, h is a decreasing exponential function of m, such that 
following zero interpolated items, a pair remains in S with 
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probability 1, while after a large number of items, a pair is 
very likely to have been forgotten.) On the first presentation 
of each trial, hi = l-(l-x25-k), which will differ negligibly from 
0 in the present study (e.g., consider Peterson & Peterson, 1959). 
Following the second presentation, h^ * , , where is the 

number of interpolated items. If tlJe state probabilities are 
represented by lower case letters subscripted with the presentation 
numbers, then the probability of a correct response on the r^th 
presentation, and the state probabilities are described by the 
following equations. 



Pn * -^n ^n 9‘lf^n 

■^n ~ ^n*l ^fn-1 

So * ^liSo-i + ®^i(l-d)fo-l 

fn = (l-hi)so-l + (l-d)(l-hi)fo.i 

where i is 1 on even presentations and 2 on odd presentations. It 
can be shown by induction that for even values of n, (on the first 
test presentation of a trial) 

f„ = {(l-d) [l-d(l-h2)l)"-l 

Sn = 0 

For odd values of n, the state probabilities are 

= (l-h2)(l-d) {(l-d) [l-d(l-h2)lj 

s„ = hgd-d) {(l-d)U-d(l-h2)]}-"*^ 

These equations can be used to derive the expected total errors on 
the first and second presentations. Except for labeling of the 
parameters, the expressions are identical to the stimulus fluctua- 
tion model, and by inspection of the transition matrix, it can be 
seen that prior to entering L, the process will have to be 1r» F 
for 1/^ trials, on each of which an error is 1-g, so that total 
number of errors over the first and second presentations is again 
predicted to be constant over all spacing conditions at 



[9] 

[ 10 ] 
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d 



[ 11 ] 



Another model, which involves incremental acquisition and 
forgetting processes, is described in Section 3. This latter 
model, which was developed to handle list-length variation in 
paired-associate learning, does not predict constancy of total 
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errors, but instead comes close to predicting the observed pattern 
of total errors. The model is not without its own^problemSt how- 
ever, and these will be considered in some detail in Section III. 

To anticipate one aspect of that discussion, a problem which 
arises in studies of the effects of spaced presentations on paired- 
associate learning is that, when test trials are given, with in- 
creased spacing there is a concomitant increase in the rate of 
errors on tests. It may well be that the deleterious effects of 
massing arise because the subject is not receiving appropriate 
feedback about the adequacy of his storage strategies with massed 
presentations. I.e., errors may provide information to the sub- 
ject that he hasn't learned particular S-R pairs and needs to do 
additional encoding. Evidence on the role of active ^responding 
(and, in particular, of errors) on acquisition of paired associates 
is sparse. It has been shovm that an uninterrupted sequence of 
study trials, during which the subject makes no responses, and 
hence receives no information about the effectiveness of his 
storage operations, is a relatively inefficient technique (Izawa & 
Estes, 1966). Data from a study now in progress (Watters, 1966) 
in which the subject times his own feedback interval under an 
anticipation procedure show a big difference in intervals following 
precriterion successes and errors, the latter intervals being much 
longer. These findings suggest the importance of further investi- 
gation of the role of the response in learning verbal associates. 

For the present, theoretical models based on spacing effects, short- 
term memory effects, or the idea that effective feedback is more 
likely to be associated with errors than correct responses, cannot 
be differentiated from one another on the basis of available data. 

The failure to find any significant difference in Experiment II 
as a function of massing of items either within or between subjects 
seems surprising in view of the results of the initial study. How- 
ever, the greatest decrement in efficiency due to massing is seen 
with zero interpolated pairs. Even in Group III of Experir^nt 
where massed items are presented three times per trial, the randor- 
ordering was constrained so that an item was never repeated twice 
in immediate succession. The number of presentations per trial was 
also shorter in Experiment II (15 as compared to 27 in Experiment I, 
Even taking these factors into account, the result is unexpected. 
Based on the low association data (presentations to criterion) from 
Experiment I together with the average number of interpolated items 
separating successive presentations of massed and spaced items in 
the second study, one would predict that the massed items would 
require about 8.8 presentations to reach criterior«, compared with 
about 7.4 for spaced items. At none of the test points for either 
replication is there any indication of superior perfo»nance for the 
spaced items. At the same time, the short-term retention effects 
of massing can be observed by a decline during training trials in 
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the probability of ao error over successive repetitions within a 
trial. Averaged over all trials, the probabilities are .65 and 
.55 for Group II, .62, .44 and .29 for Group III. These values 
may be compared with the corresponding presentations of spaced 
items, .57 and .51 for Group II, .69, .59 and .54 for Group III: 
Comparison of these two sets of values shows a much greater decline 
over pairs (or triplets) of presentations for the massed items 
versus the spaced items. Apparently, as long as, variation in the 
degree of spacing of items does not exceed certain bounds, the 
subject is able to allocate his associative efforts in such a 
fashion as to learn all items with about equal efficiency. A 
complete specification of these boundaries is not possible from 
these data, but one significant limit is irrediate rutest 
and reinforcement of a previously presented item. Such an opera- 
tion is simply a waste of time. 

The results of the final study show that a major source of 
interference in verbal learning is produced by unlearned items in 
a list, and, by implication, results from the active processing 
and storage of information by the subject. Using a short-term 
retention design, Tulving and Arbuckle have shown that signifi- 
cantly greater interference results from input or study presenta- 
tions as compared with output or test presentations. ^Posner and 
Rossman (1965) have also found that as the amount of information 
processing required during an interpolated task is increased, 
higher rates of retention loss occur in short-term memory. The 
design used in Experiment III was such that not all the items in 
the prelearned list were well-acquired prior to training on the 
experimental list. In other words, some of the ‘“learned”* items 
had not in fact reached criterion. Four subjects in Group L^had 
reached a criterion of one perfect recitation of the first list. 
Trials to criterion for the experimental items for these subjects 
were 2, 3, 4 and 4.5, which don*t seem unreasonable values for 
learning a 4-item list. The four best subjects on the preliminary^ 
recall task in Group U required 4, 7, 9 and 14 trials on the experi- 
mental items. 

An alternative interpretation, to be contrasted with inter- 
ference-produced forgetting, is that unlearned items are .ntified 
by the subject and "selected*' for learning. Thus, on any trial the 
subject's attention is limited to a relatively few items. In Group 
L, the selected items are E items, whereas in Group 0, any of the 
items may be chosen. Battig (1966) has suggested that subjects can 
identify and respond in a differential fashion to learned items in 
a paired-associate list. In the present experiment it might be 
reasonable to suppose that subjects isolate unlearned items. 
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Section III: Incremental acquisition and forgetting processes in 

paired associate learning 

In studies of verbal learning, acquisition rate typically 
decreases disproportionately with total amount of material (McGeoch & 
Irion, 1952, pp. 487-496). For example, mean trials to criterion 
is greater as the number of S-R items in a paired-associate list is 
increased (Carroll & Burke, 1965; Runquist, 1965, 1966). This 
decrement in learning rate has been attributed (Atkinson & Crothers 
1964; Cal fee & Atkinson, 1965) to the additional interference which 
nature ny occurs with increased list length. As represented In the 
trial -dependent-forgetting (TDF) model, each reinforced presentation 
of an S-R pair has the effect of moving that pair either to the 
long-term (L) or short-term (S) memory states of a Markov process. 

either of these states, the correct response occurs 
with probability 1. Every time some other pair is presented, as a 
consequence of the interference produced, some items in short-term 
memory may be forgotten, which is represented in the model as a 
return to an unlearned (U) state. The overall interference in- 
creases with longer lists, leading to the poorer performance which 
is usually found. 

In a study in which paired-associate lists of three different 
lengths were used, the TDF model in a slightly revised form gave an 
excellent account of the data, compared with several alternative 
formulations. Nonetheless, in two respects, this model is less 
than satisfactory. First, the estimated parameter values impV' 
pat following a reinforced presentation learning (i.e., transitior 
to state L) is far more probable if an item is in state S than sta:* • 
u prior to the feedback presentation. An optimum schedule would 
therefore involve massed presentations of each item in the list, 
contrary to results obtained by Greeno (1964) and Cal fee (1966b). 
Secondly, state L is an absorbing state in which probability of a 
correct response is unity. The model thus predicts that if one 
list is learned to a strict criterion, and then a second list is 
learned, subsequent test performance on the first list should 
show no retention loss. In fact, considerable forgetting is usuall ' 
observed under these conditions. One might argue, of course, that 
such a procedure exceeds the boundary conditions under which an 
absorbing-state model can be expected to hold up. 

The TDF model is a special case of a class of three-and four- 
state Markov models for verbal learning in which the assumption is 
made that associations are formed by transition through two or 
three discrete stages, in each of which probability of a correct 
response takes on a constant value (Atkinson & Crothers, 1964; Bern- 
bach, 1965, Greeno, 1964; Restle, 1964). An alternative represen- 
tation of the learning process is the random-trial -incremental 
(RTI) model of Norman (1964), in which each pairing of stimulus and 
response is effective with probability £, and is otherwise ineffec- 
tive. Effective feedback events produce a linear increment in the 
probability of a correct response, while ineffective events leave 
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the probability unchanged. The probability of a correct response 
following the nth reinforcement, Is therefore a linear function 

Pn-1» 



Pn “ A(Pn_i) > c(xPn.i+l-a.) + (l-c)p„.j 
“ [l-c(l-a)lp„.i+c(l-«) 

where A Is the acquisition operator. This model was compared to 
various alternatives. Including both Markovian and linear-operator 
processes (Atkinson & Crothers, 1964), and proved more adequate 
than other linear models, but generally yielded poorer predictions 
than the long-short Markov model which was presented. 

The RTI model may also be arrived at In terms of a dual- 
process memory. Suppose that a stimulus -response pair can be 
represented In memory by a set of distinctive features. The prob- 
ability of a correct response depends on the number of features whic.i 
have been stored In long-term memory and are available for reference 
when an Item is presented. On some feedback presentations, no effort 
Is made by the subject to transfer features to permanent storage, 
perhaps because of attentlonal factors. On other presentations, tii-^ 
subject affects a transfer of some number of features; the number 
of features transferred Is assumed to be proportional to the number 
remaining to be transferred, which implies a linear operator If the 
total number of features Is very large. 

In its original form, the RTI model cannot handle list length 
effects without additional parameters for each list. By suitable 
modification to take Into account the effects of Interference pro- 
duced forgetting, the model can be extended to account for variations 
In list length, and Its performance relative to Markov models Is 
also substantially Improved. Further, the revised model can make 
predictions about overlearning and distribution effects, as well 
as retention losses due to learning of Interpolated lists, which 
are generally consonant with experimental findings. 

The basic modification proposed Is to assume that when a par- 
ticular stimulus-response pairing takes place, the entire feature 
list for that pair Is stored in short-term memory. With probability 
£, the also engages In an encoding operation, in which some of th»' 
features are transferred to long-term memory. Following the fecdb- c!c 
Interval, there Is a gradual loss of information stored In short- 
term memory about this particular pair, because of the limited capa 
city of short-term memory and the need to process the other Items 
being presented. This loss Is also assumed to take the form of a 
linear operator, which Is a function of the number of Interpolated 
Items, where the limit of the forgetting operator Is determined by 
the acquisition operator. Specifically, the probability of an error 
for a particular Item on trial £ following m Interpolated Items, anH 



19 



o 



Ineffective feedback events, where f is the forgetting parameter, is 



The process is presented graphically in Fig. 1. 

Although the process may appear somewhat involved, derivation 
of various statistics is remarkably straightforward. This p»esen- 
tation will be somewhat abbreviated; for more detail on the RTI 
model, see either Norman (1964), or the treatment in Atkinson, 
Bower, and Crothers (1965). Consider a particular item for which 
there has been Ineffective reinforcements through trial iv-U ^nd 
let Sn-i be thelndicator random variable for the nuntber of effec- 
tive reinforcements on trial n-1. Then it can be shown that S , 
has the binomial distribution”with parameter £, — 



The rth raw moment of the distribution of response probabilities 
on trial n, is therefore 



The terml jy and the sums on i^ and j take into account the possible 
distribution of interpolated pairs between the presentation of an 
item on trials n-1 and £, respectively (cf. Calfee & Atkinson, 

1965, for additTonal detail). There can be between 0 and X-1 other 
pairs following the presentation on trial £-1, and each of these 
events has probability 1. A similar analysis holds for the number 
of interpolated pairs preceding the presentation on trial £. The 
sum on Intakes into account the number of effective reinforcements. 

Making the appropriate substitutions in Eq. 15. 



In the case of the first raw moment or mean learning curve, th * 
expression reduces to 
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Pre'-i-ntation nunber 



For very long lists, the function approaches the RTI model in the 
limit. I.e., the short-term memory is of no help in improving 
performance, because the probability that any information remains 
in short-term memory between one presentation and the next is 
vanishingly small. 

This model, which will be referred to as the RAF model (random- 
tn‘als-acquisition and forgetting), has several properties in"lcommon 
with the TDF model. Acquisition due to feedback and forgetting 
associated with interpolated items are both explicitly accounted 
for. The model is semi -Markovian, in that if the proportion of 
features stored during effective feedback events is reasonably 
large, then there will be a small number of states v/hich can be 
differentiated on the basis of response probability. Thus, if 
ocis fairly small, e.g. less than .25, then after tMO or three 
effective reinforcements, the probability of a correct response is 
so close to 1 that subsequent feedback events may produce no 
measurable change in performance, and the subject may appear for 
all intents to have reached an absorbing state. Performance may 
exceed learning to a substantial degree with short lists, where 
there is little interference-produced forgetting between successive 
presentations. However, the effects of feedback presentations are 
not washed out by subsequent Interference; instead, performance 
declines to the level of the basic acquisition process. For example, 
suppose a subject is given five trials on a 4-item list, by which 
point performance should be almost perfect. Then five trials are 
given on a second 4-item list, followed by a test trial on the first 
list items. According to the RAF model, during the second-list 
learning, probability of a correct response on the first list items 
will decline to 1-qj^ [l-c(l-oOl . In the TDF model, on the other 
hand, first-list items in state L will remain there during the 
learning of the second list, while any item not in state L will 
return to state U, or a chance level of performance. Finally, 
the RAF model makes the rather interesting prediction that the 
effects of variation in list length should appear only during 
acquisition performance. If a list of length X is presented for 
reinforced trials, and a suitable delay involving some kind of 
rehearsal-preventing activity is interpolated, then on a test 
following the delay, the proportion of correct responses is pre- 
dicted to be a. constant, invariant with changes in X, and equal to 
l-qi ll-cd-eO*^. 

The relative adequacy' of the TDF and RAF models was determined 
initially by comparing their performance on two sets of paired- 
associate data from experiments in which list length was varied. 
Experiment SH is reported in detail in Calfee and Atkinson (1965). 
Briefly, each subject learned a list of 9, 15 or 21 pairs, in which 
the stimulus members consisted of two-digit numbers, and the responis 
terms were one of the three nonsense syllables, RIX, FUB, or GED. 

The list was presented in a standard anticipation mode for a total 
of ten trials with no breaks between trials. 



Experiment RH involved more difficult stimulus materials, 
four rather than three response terms, and list lengths of 8, 16, 

24 or 32 items. Otherwise, the basic procedure is quite similar 
between the two studies. The stimulus items in the second study 
consisted of 32 four-letter Russian nonsense words. Each of *he 
31 letters in the Russian alphabet was used with approximately 
equal frequency, and appeared no more than once in a given word. 

The basic list of 32 items was divided into four sublists of 8 
items each so as to yield minimal intralist similarity in the 
judgment of the experimenters. Within each sublist, the first and 
last letters are different for each item in the list. The responses 
consisted of the digits 1 through 4. The subject responded by 
marking the appropriate column on an IBM mark-sense form. Two 
random assignments of each response term to two stimulus items 
within a sublist were prepared. A total of 16 subjects were run 
under each list length condition, either 8, 16, 24 or 32 pairs. 

Each sublist was used equally often for every list length, as were 
the two random stimulus -response pairings. The number of trials 
was fixed within each condition, an attempt being made to chose a 
number which would reduce error rate on the last trial to about .25. 
The number of trials for 8, 16, 24 and 32 pairs was 9, 12, 14 and 
15, respectively. Subjects were given instructions about the task 
and a brief pretraining list to familiarize them with the procedure. 
After brief rest, the experimental list was presented. Stimuli were 
printed on 3x5 cards. The first four items were dumny pairs, which 
did not appear thereafter, and were included to eliminate any pri- 
macy effects. Responses were self-paced, in that subjects were 
given as long as necessary to record their answer. After the 
response, the correct answer was then pronounced by the experimenter, 
and three seconds later, the next stimulus card was presented. The 
stack of cards was prepared beforehand, and kept behind a shield, 
so that the subject could not see how much longer he had to go. 

The mean learning curves for Experiment II are presented in 
Table 4. There is a substantial effect apparent as list length 
increases from 8 to 24 pairs; the further increment to 32 pairs 
produces little additional change in the performance curve. The 
hypothesis of precriterion stationarity based on Vincent quartile 
(Suppes & Ginsberg, 1963) can be rejected at better than the .001 
level for all lists. As may be seen from the quartile data which 
are presented in Table 5, the functions show a tendency to flatten 
out at the shorter list lengths. 

The data used in parameter estimation and primary evaluation 
of the models were 4-tuple sequences of successes and errors for 
individual stimulus -response pairs on trials 2 through 5 and 6 
through 9. The 16 possible combinations of successes (s) and 
errors (e) for each of these four- trial blocks are listed in Table 
6 for the three groups in Experiment SH, and Tcble 7 for the four 
groups in Experiment RH. 






23 



Table 4 



Probability of an incorrect response, trial by trial 

Experiment RH 



Trial 

1 

2 

3 

4 

5 

6 

7 

8 
9 

10 

11 

12 

13 

14 

15 



List Length 




24 
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Table 5 



Probability of an incorrect response 
precriterion in Vincent quartiles. 
Experiment RH 

List Length 



Vincent 

Quartile 


8 


16 


24 


32 


1 


.67 


.74 


.76 


.77 


2 


.57 


.71 


.73 


.73 


3 


.64 


.69 


.72 


.71 


4 


.62 


.66 


.66 


.65 
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Observed and predicted frequencies for 4-tr1al response sequences, trials 6-9, Experiment RH 
8 Items 16 Items 24 Items 32 Items 
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For each of the models, the theoretical expression for the 
probability of each four-trial sequence was determined. Following 
Atkinson and Crothers (1964), let 0^ i n ith^f our- tuple for 

Group i where the sequence begins on’tflal n. Let N(04 j n) be 
the observed frequency of this sequence, and N(0i . „;pj 6e the 
predicted frequency for a particular choice of th^*^ parameters, p., 
of the model. Then define the function 



A measure of the discrepancy between a model and the data froi.. 
Group is 



A discussion of the properties of this statistic can be found in 
Atkinson, Bower and Crothers (1965), and in Holland (1966). 

Briefly, the quantity is distributed approximately as chi 
square with 30 degrees of freedom, disregarding loss of df due 
to parameter estimates for the moment. 

• i estimates were found for each experiment which 
yielded minimum values of the sum of X^<s over all groups within an 
experiment. (The parameter search was carried out to three decimal 
places on a computer using a search program for parameters for 
non-linear equations developed bv the Mathematics Research confov*. 
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Table 8 



Parameter estimates and values 
for Experiments $H and RH 

Value 











Trials 


Trials 


Total 




Model 


Parameter 


2-5 


6-9 




Exp. SH 


TDF 


a 


.42 


49.6 


65.9 


115.5 






b 


.11 












f 


.19 










RAF 


a 


.09 


60.7 


90.2 


150.9 






c 


.21 












f 


.15 










RAF2 


a 


.008 


47.5 


66.5 


114.0 






c 


.20 












f 


.56 








Exp. RH 


TDF 


a 


.33 


73.8 


147.2 


221.0 






b 


.07 












f 


.25 










RAF 


a 


.13 


81.2 


126.7 


JL07.9 






c 


.05 










f 


.20 










RAF2 


a 


.009 


68.5 


97.8 


166.3 



c .09 
f .65 
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Table 9 



Observed and predicted mean total errors 
on trials 2 to 9, Experiments SH and RH 





9 Items 


Obs. 

1.21 


TDF 

1.29 


RAF 

1.65 


RAF2 

1.53 


Exp. 




15 Items 


2.03 


2.07 


2.02 


2.10 


SH 




21 Items 


2.50 


2.58 


2.16 


2.36 





8 Items 


3.11 


3.10 


3.72 


3.47 


Exp. 


16 Items 


4.41 


4.51 


4.57 


4.57 


RH 


24 Items 


5.10 


4.92 


4.72 


4.76 




32 Items 


5.00 


5.07 


4.77 


4.84 
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of forgetting produced by each Interpolated item should be pro- 
portional to the probability of an error for that item. In the 
original version of the model, the probability of an error on 
trial n^ after m interpolated items and Ineffective reinforcements 
was 






[ 20 ] 



In the revised model, the equation for q|(,m,n same, except 

that the exponent of the forgetting process! im-l, becomes (m-l)Vi 
which alters the forgetting function in the desired fashion. * * 



A second modification arose from the following considerations. 
As the number of items in a list becomes greater. It should be 
necessary to store in memory more features to allow successful 
retrieval of the correct response. For larger lists, there is 
almost always a corresponding increase in formal intraJist sim- 
ilarity because of limits on the number of distinctive units, i* 
the present case, alphabetic characters. If the number of featu.e' 
stored on each effective feedback is more or less constant, then 
the requirement of a larger total number of features for longer 
lists implies thatc^, the rate parameter for the acquisition pro- 
cess, should be larger for longer lists. (The parameter oc is in- 
versely related to speed of learning; the larger is, the slower 

acquisition occurs.) The specific assumption is that o( is an expo- 
nential function of list length, X, 



These tv^o modifications only slightly affect the mathematical 
tractability of the model. Although the enuations for the learn- 
ing process take cn a more formidable appearance, only minor 
alterations were necessary in the theoretical expressions used 
in the parameter-search computer program. The predicted fre- 
quencies for the 4-triple response sequences from the revised 
models RAF2, are presented in Tables 6 and 7; parameter estimates 
and X values are in Table 8. As may be seen, the fit of the 
model is substantially improved with these modifications. 
(Analyses of the effects of each of the modifications in turn 
shov/ed that each modification contributed about equally to thj 
reduction inX •) However, the model continues to deviate sys- 
tematically from the data in that too many errors are predicted 
for short lists, too few for long lists. 
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Two other extensions of the RAF model have been considered, 
but no final results are available at the present time because 
of problems In carryinn out the mathematical analysis. First, 

If the Idea about acquisition of verbal material by storaoe of 
discrete features Is to be taken seriously, then a more appropri- 
ate formulation v/ould be to replace the linear operator process 
with an M-element pattern process (Estes, 1959; Atkinson & Estes, 
1963), (The feature-list process has been sunoested In one form 
or another by numberous Investigators, e.q,, Atkinson & Shiffrin, 
1965; Bov;er, 1966; Felgenbaum, 1963). As fl, the number of fea- 
tures or patterns, becomes small, there are significant chances 
In the nature of the theoretical acquisition process. For ex- 
ample, the limiting form of the proposed pattern-RAF model is 
the one-element all-or-none model, whose properties are well 
knovm (Bower, 1961) and differ considerably from the RAF model. 
(Actually, a more suitable one-element analog to the RAF model Is 
the TDF model, since the one-element all-or-none model does not 
take Into account short-term memory and Interference-produce*^ 
forqettlnn.) The feature-list process has been worked out for 
the two limiting cases, r!*l and The derivations for inter- 

mediate values of n are presently under Investigation, but the 
derivations have not been completed. 

The second extension concerns the assumption that learning 
or storage of Information Is more likely to occur fol levying an 
error than a success. One approach would be to assume that 
effective reinforcements occur only on error trials. The result- 
ing model is a two-operator linear process. The forgetting oper- 
ator Is experimenter-controlled (Bush & r^ostellor, 1955) In the 
sense that the subject's responses do not determine the number 
of Interpolated Items, The acquisition operator Is subject- 
controlled, according to the relation, 

%+l * If an error on n 

%+l * ^n ® success on n 

This particular model Is not too difficult to work v/1th, and 
some analyses are presently underway. It appears that this modi- 
fication of the RAF model will take care of a problem first noted 
by Atkinson and Crothers (1964) and also found In the two list 
length studies, viz,, the random-trlals-incremental process pre- 
dicts too many errors during the first few trials. The modifica- 
tion has the effect of Increasing acquisition rate during the 
early trials. However, this form of the model also predicts 
constancy of total errors over the various spacing conditlonr of 
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Experiment I In the precedi nc' section. Hence* a more Interesting 
extension v;ou1d be the case where learnlno occurs following both 
successes and errors, but with different probabilities of occur- 
rence, This process Is currently under study. 

next, the application of the basic RAF model to the error 
data of Experiment I In Section I v/111 be considered briefly. 

From Eg, 17, It can be seen that on the first presentation on 
each trial, the probability of an error Is 



» qj Il-cO- , i = 0, 2, 4 



t t t 
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since viHh more than 20 Interpolated Items from the last presen- 
tation of the item. It Is likely that short-term retention of the 
pair vn'll be negligible. Hence, total errors to criterion on 
the first presentation for repeated Items will be 



° h Si 



(22J 



1 - ( l - c { 1-«)]|2 



The model makes the strong prediction of constant errors to 
criterion on the first presentation for repeated Items, More- 
over, total errors for the once-per-trlal Items should be 
qi/ \1 - [l-c(l-oO] Implying that there should be noticeably 
more errors to criterion for the bnce-per-trlal Items than for 
the*;f1rs‘t preseotatlon on repeated Items, Total errors to cri- 
terion from the second presentation for repeated Items can be 
found In a similar fashion to be 



To = 



"iji- 

L itf Jjd- 

■ I 

■ ki -if ; 






■c{1-«)]^l-c(l-o0l 



i-1 



qi |l- 



[23] 






where jc is the number of Interpolated Items between the first 
and second presentations. Parameter estimates of c(l-i)0 and f 
viere obtained by a least-squares procedure, and the predicted 
total errors are to be found in Table 2, It is obvious that 
total errors from the sum of first and second presentations 
should Increase with increased spacing, according to the model. 
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In this process, the number of presentations required to reach a 
reasonably stringent criterion is constant, independent of spac- 
ing. The effect of massing is to reduce total errors to criterion; 
this increase in correct responses prior to reaching criterion 
reflects short-term retention. The failure of the basic model to 
account for increases in presentations to criterion vjith greater 
massing results from the assumption that effective feedback is 
equally likely every time a stimulus-response pair is presented for 
study. Comments made previously about modification of this char- 
acteristic of the model (e.g, differential probabilities of con- 
ditioninn after successes and errors) are relevant here also. 

The final study in this section concerns the use of learning 
models to specify optimal strategies for presentation of verbal 
material. Specifically, a two-operator linear model is examined 
v/ith reference to the optimal allotment of presentation time to 
the tv/o halves of a "split" paired associate list. This viork is 
an extension of Suppes* (1964) investigation of optimal block size 
in paired-associate learning. 

Suppose a list consists of m stimulus-response nairs divided 
into blocks of k items, each blo^^ to be presented n times. The 
problem v/as to Tind that value of jc which yielded tHe highest 
number of correct responses on a test following the training series 
Suppes* finding based on the two operator model described below, 
v/as that if learning took place at a more rapid rate than for- 
netting, then the block size should be as large as possible; i.e,, 
choose k » m. If forgetting occurred more rapidly than learning, 
then the block size should be as small as possible, k * 1, The 
first procedure has been termed the whole-list method, and much 
available data suggests that this method is more efficient. 

The present paper represents an extension of Suppes* orioinal 
paper, in which the requirement that each item be presented a 
fixed number of times is relaxed. Suppose the material to be 
learned consists of a list of 2m items, in v/hich the first block 
of m items is presented for ^ trials. On each trial, the m items 
are all presented in random order. The second block of m it'^ns 
is then presented for T-t trials, following vjhich a retention test 
of the entire set of 2m items is administered. The problem is to 
find a value of v/hich yields a mean error probability during 
the test which is as small as possible. Since interference pro- 
duced by the learning of items in the second block may lead vo 
considerable forgetting of the first item learned, it would seem 
intuitively that the items in the first block should receive some- 
what more than half of the training trials. The analysis below 
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substantiates this conjecture, and indicates hov; the choice of 
depends on the learninp and forgetting rate parameters. 

It is helpful in reaching a solution for the more complex 
case. to consider first the very simplest situation, viz,, a list 
of two items. The first item is presented for jt trials, and the 
second for T-t, Assume that acguisition and forgetting processes 
may be described by a tv/o-operator linear model. On each rein- 
forced training trial for item 1, the probability of an error is 
reduced by applying an acquisition operator Oj^, 



whereas each time the other item is presented, the probability of 
an error for item i is increased by applying a forgetting operator 

Or, 



where q is the initial error rate. The operators 0. and Or are 
first-order linear difference equations of the form^ ^ 

Inhere R and S are constants. The well-knov/n solution to this 
process is 



The derivations in this paper are based on this result in large 
part, and intermediate steps are omitted for brevity. The deri- 
vations generally follow the pattern in Suppes* paper (1964) 
to which the reader is referred for further detail. 

This two— operator model is sim.ilar in some respects to the 
RAF model, 1\#o Important differences are (1) c « 1, so that 
every reinforcement is assumed to be effective, and (2) the 
forgetting operator has the effect of reducing response proba- 
bility from the acquisition level reached following a reinforce- 
ment to the base guessing level. Work is presently underway 
to Investigate the properties of the RAF model with regard to 
block size and split list presentations. Since the forgetting 
function affects only training performance and not long-term 



"nn+l = "l ' afl.n 
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retention t then the basic model must predict no effect of block 
size. For the RAF model revised so that learning is more likely 
* then whole list learning should be somewhat more 
efficient in any circumstance. The relative inefficiency of var- 
ious part-list procedures remains to be determined. 

For the special case of a two-item list, following t trials 
on^item 1, qg ^ will still be equal to while qi ^ will equal 

^ O’ the’end of training, the error probabilities for the two 
Items will be 

E(qi) = = [l-(l-a^)bT-t]q 

E(q2) = \^‘*(q) = a^'^q [28] 



Hence the expected number of errors on the posttest will be 

E(q) « (1-b'^*'^ + a^ + a*^“'*')q [29] 

The value of t which minimizes E(^ can be found by standard nethoc';; 

find dE/dt, set the derivative equal to zero, and solve for t. From 
Eq. 5 we have 



dE 

it 



~ -jb"^ [b*'®' log b + t|a_j log ^ q 

[30] 



Setting this derivative equal to zero yields the equation 
1 - a^ = [ 



f T-t - a^J Aog a'\ 
1 b/ \iog hf 



[31] 



A concerted effort to find a solution in t for this equation 
was unsuccessful. However an approximate solution was arrived at 
by noting that if a is not too close to 1 or t is relatively large, 
the ^ term, ai^, may be small enough to be disregarded. Assuming that 
a is negligible, then the optima! value of t is 



'op 



° teti) 

log b - log a 



[32] 



To the extent that the approximation holds, then for any value of a 
and b, the number of trials which should be allotted to the first 
item is a linear function of T. Moreover, the number of trials 
remaining to be allocated to the second item is a function of a 
and b, and is independent of T. 
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As a tends toward 1 (slower learning), then item 2 should receive 
a larger share of the trials, whereas if this parameter is close 
to 0 (rapid learning) a single presentation of item 2 may be opti- 
mal. On the other hand, if b is near 0, then it may be the case 
that item 2 should not be presented at all, since any presentations 
of this item wip lead to virtually complete loss of the newly 
formed association for item 1. (There is no reguirement that the 
two items be learned equally well.) If b is near 1, then more 
trials may be allotted to the second item, because interference 
will be minimal. 



^Turning to the more general problem, assume that the list 
consists of 2m items, where m>l. The items in a block are pre- 
sented in random order on eaBi trial, so that a particular itpm 
may be the first, second, ...mth item presented on the trial. 

Hence, between successive presentations of a given item, 0 to 
2(m-l) other items may be interpolated; on the average m-1 items 
are interpolated between successive presentations of a given item. 

A precise denvation of the expected error rate following training 
would involve computing the error rate for each random sequence 
which might occur, and finding the expectation over sequences. 

This derivation is very cumbersome, and so as an approximation we 
will find E(q) based on the average location of an item in the 
sequence.^ That is, consider an item which is presented in the mid- 
dle position on each trial as indicated in the diagram below for 
block I where X represents a reinforced presentation of the “'aver- 
age®' item, and 0 represents presentations of other items: 



Trial 1 



Block I 

C-. 

Trial 2 




Block II 



Trial t-1 Trial t'' 



0—0 

nPl m-1 

items 2 

items 




(T-t)m 

presen- 

tations 



From this description of the presentation sequence, it can' be seen 
that the ^probability of an error for an item in block I at the end 
of training will be in terms of the two operators Q|^ and Qp 

E(qi) = (QlQf"'*’) ^•^QLQF'^F^T-t)m q [33] 



There are t-1 complete cycles of a reinforced presentation followed 
by m-1 other items, and then on trial t, there is a partial cycle 
consisting of a reinforced presentation followed by (m-l)/2 other items 
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The Items in block II then constitute a total of (T-t)m presenta* 
tions on each of which the forgetting operator is applied. Sub- 
stituting the first-order difference equation for the operators 
yields 

E(q]) = [1 + X(l-Z) - Y(l-aZ)]q 
where X *^a\t j-tm- iii-l 

VFj ^ “T [34] 

Y = + m-1 ■ 

Z = l-b^^-l 

Tlab^ 

From similar considerations it can be seen that 



E(q2) = 

= [1+x' (1-Z) - Y' 0-aZ)] [35] 

where 

x' = {ab“"'®)'^"‘*^'“'*b ^ 

Y' * b !5ll 
2 

The average error ratOt averaged over items in both blocks* is simply 

E(q) = i q [2 + (1-Z)(X + X') - (l-aZ)(Y + Y')] [36] 

Since only the quantities X» X^ and Y involve the parameter t* with 
respect to which a minimum is being sought, the optimization problem 
is reduced to finding a minimum in _t for the function 

F = (1 - Z)(X + X') - d-aZ)(Y+Y'). [37] 

The derivative is 

^ = (1-Z) ^ + d^ - (1-aZ) ^ 

dt dt dt dt [38] 
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where 



dx = bfm- Sljl/a't 1ogfa \ 

^ \^l U/ 

dX' = - (ab'"-1)'^'’^*^b ^ log (ab*"-^) [39] 

dt 

^ = . b(T-t)nH- log b 

dt 

dF 

When is set-to zero, there is again no apparent solution i*; 
for tne resulting equation. Following the strategy used for m * 1, 
assume that if (1) £ is not close to 1, (2) lb is reasonably large, 
and (3) T is not too small, then ^will be negligible, so that an 
approximate value of t^-j may be found by solving the following 
equation for t: ^ 

0 * -d-Z) ^ - (1 - aZ) [40] 

dt dt 



Making the appropriate substitutions for the derivatives, elimin- 
ating the common factor expressing log (ab^"!) as (log a 

+ (m-1) log b), and dividing through by log b yields the result, 

a t-(T-l) s (1-Z) f log a + m-fl . [41] 

b mbm (1-aZ) \ll®9 ^ 

Letting the quantity on the right side of the equation be ri-spresented 
by L and taking the logarithm, we have 

t - (T-l) log/ a "j = log C [42] 

or 



top ° ^°9 C 

^°H^) [43] 

If £<-b, which will be the case with data to be consider'd 
later, and Tf log C is positive, then the final term in the equation 
above will be negative. Hence, the optimal number of crials will 
be approximately a linear function of T, the total number of trials 
available for both blocks, such that one or more trials should be 
given to the second block, but the exact number is independent of 
the total available trials. 
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The adequacy of the approximation was determined by program- 
ming a computer to search for optimal values of for which the 
derivative of Eq. 37 Is equal to 0. The parameter space looked 
at Included values of a from .3 to .7 in Increments of .2, b from 
.90 to .999 In a roughly exponential series (more closely spaced 
as the boundary of 1.0 was approached), list lengths of 20 and 50, 
and total trials, T, from 2 to 25. The approximation Is quite 
good (the difference between actual and approximate top is .1 or 
less) except for a and ^ close to 1 (.995 or greater) ,^and for 
small values of m and T (lists of less than 10 items, or fewer 
than 10 trials). ““ 

Two properties of the parameters a and b are worth noting 
briefly. First, suppose a>b^. In this case, the proportionate 
reduction In error probability which results from a study presenta- 
tion 1s less than the reduction In success probability which occurs 
when another item Is presented. (Recall that the operator Qp is 
applied once for each Interpolated Item in a list.) From Suppes' 
(1964) results, when a list is divided Into blocks of size and 
the items In each block receive an equal number of trials, o|-uimal 
block size is 1. That Is, the first item should be presented T/m 
times, then the second item, etc. Because forgetting takes place 
at a faster rate than acquisition, perfect performance Is Impossible. 
The expected en?r rate, E(q), following a total of n_ trials rer 
item for each of m items, with a block size of 1^, was shown by 
Suppes (1964) to Fe 

E(q) = q - (1-b™)0-a) A-Y'’Xf^ h-X 'll 

^ (l-b)m V 1-YX / VTF// [44] 

where and X = b^. If and the optimal block size of 

1 is used, then Eq. 44”l)ecomes 

E{q) = q 5l - (1-b"'")(1-a")\ 

^ m (l-bii) 'J [45] 

As in becomes large, E(q) approaches a limit of q(l-l/m). The 
situation is perhaps more obvious if the guessing rate is close to 
zero. Then the prediction of the model is that the probability of 
a correct response after a large number of trials is 1/m; I.e., 
that only one item from the list will be learned. Performance 
generally reaches a higher level than acquisition of a single item 
in most situations, and hence It seems reasonable to suppose that 
generally a<b, at least within the context of this particular 
model. Crothers (personal communication) has found better retent'*on 
under part-list than whole-list presentation with very long lists 
of Russian vocabulary items, which implies a>b. However, it still 
seems doubtful that a block size of 1 would”be optimal. Rati.cr, o/.'j 
might question the adequacy of the model, which cannot predict that 
intermediate block sizes will be optimal. 
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Second, taking into account the magnitude of retention loss 
typically observed in paired-associate learning, it is likely that 
under most conditions b will be very close to 1. Suppose, for 
example, that a list of 10 items is learned to a rigorous criterion, 
so that it is reasonable to assume that the probability of a correct 
response is close to 1. Next the subject is given five trials on 
a second, 10-item list, for a total of 50 presentations, on each of 
which the probability of a correct recall of items in the first list 
is reduced exponentially with parameter b. If b is .999, the reten- 
tion loss is .05, while if b is as low as .99, Ihe loss is about 
.60; complete loss of the material in the first list is expected if 
b is less than .95. 

More generally the asymptotic probability of a correct response 
after many training trials using the whole-list method, given a<b, 
will be (c.f. Eq. 44) 

lim E(q) = 1 - (l-a)(l-b>^) 

n — ^ md-bjd-abtn-l”) [46] 

If ^ is not fairly close to unity, then even for relatively short 
lists (e.g. 10 to 20 items), the asymptotic error rate is mud. 
higher than is typically observed in paired-associate learning, in 
which "perfect" performance (i.e. a very low error rate) is readily 
reached by most subjects. Given that ^ is approximately 1 and m 
not too large, then (l-b*’^) = (l-b)m, and (l-abi^"l) = (1-a), so that 
the error rate will be close to zero. To suggest that b be very 
near unity means that the interference produced by a single inter- 
polated S-R presentation is slight; the retention loss over a 
series of interpolated pairs of even moderate length rapidly reaches 
sizable proportions, however. 

The fina'. theoretical question concerns the efficiency of 
split-list learning, given optimal allocation of trials to the 
first and second blocks, compared with whole-list presentation. 

Uhile it has not been possible to prove this result analytically, 
it appears that the whole-list procedure is always more efficient 
if The expected error rate, computed numerically over the 

:»ame range of parameter values used in evaluating the approximation 
to tpp, was always lower for the whole-list method than optimal 
split-list. Plots of the efficiency function, E(q) split "E(q) v^hole 
show that the exact value of the difference bears a complex * 
relation to the parameters, a and and the number of trials. 

However, for values of ^ between .3 and .7, the difference in 
error probability between the whole-list and split-list procedures 
is slight (generally less than .05). Empirical support for the 
theoretical prediction of a small loss in efficiency when using 
the split-list procedure, would be significant because often there 
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are practical reasons to present material in blocks rather than using 
the whole list. For example, the error probability during training 
typically remains quite high for a considerable number of trials 
with the whole-list method, which may cause the student to feel that 
he is making very little progress, with a resultant lowering mo* 
tivation (Hovland, 1951). Under the special circumstances of the 
experimental laboratory, this motivational decrement may often prove 
inconsequential, especially since experimental lists are typically 
short pess than 50 items). Outside the laboratory, and with fairly 
long lists, the problem is probably of more concern, and it is of 
importance to be able to specify the conditions under which the loss 
in efficiency is slight. 

The experiment described below w<iS carried out to obtain some 
preliminary data on the effects of variations in blocking procedures 
on efficiency of learning foreign language vocabulary associates. 

METHOD 

Subjects. — Ten graduate students in psychology at the Univer- 
sity of Wisconsin, Madison, served as Ss. None of the students had 
any previous experience vnth the German language. Each received 
ten dollars (about $1.00 per hour) for his services. 

Materials. — The vocabulary items consisted of five lists of 
50 German words. Each list was composed of approximately the same 
number of nouns, verbs, adjectives, and other parts of spveech. 

Obvious english cognates were avoided. Within-list similarity was 
kept low in the stimulus members by not permitting two or more 
words from the same root to occur in a single list, and in response 
terms by eliminating synonomous response terms within a list. Each 
German stimulus word was printed on a 1.5 x 3.5 in. card, and the 
appropriate english equivalent was printed on the reverse of w.e 
card. 



Procedure. — Five different presentation conditions were used 
in the study: 

W Whole-list; a list of 50 words was presented in random order 
for five trials. 

S-5/5 Split-list with a 5-5 division; the list of 50 words was 
divided into two 25 word blocks. The items in the first 
block were presented in random order for 5 trials, and 
then the items in the second block were presented for 5 
trials. 

S-7/3 Split list with a 7-3 division; the items in the first 
block were presented for 7 trials, and then the items 
in the second block were presented for 3 trials. 
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S-9/1 Split list with a 9-1 division. 



P A part- 11 t presentation; the 50 words were divided into 5 
blocks of 10 words, and each block was presented for 5 
trials. 

Subjects were run under a different condition-list combination 
for five consecutive days using a Graeco-Latin square design so that 
all lists and conditions were represented in a sequentially counter- 
balanced order over a set of five Ss. Two different squares were 
made up for the two sets of 5 Ss. At the conclusion of each training 
session, a recall test (T-1) on the material just learned was admin- 
istered. A second test (T-2) was given 24 hours later, just prior 
to the training trials on the next list. (The second test for the 
fifth day's material was also administered 24 hours after initial 
training.) A final test (T-F) on all 250 words from the five lists, 
randomly ordered, was performed from 3 to 5 days following the final 
training session. 

At the beginning of the first session, the subject was given the 
following Instructions: 

"In this experiment you are going to learn some german 
On each trial, I will show you a card which has a german word printed 
on it, and I will pronounce the word for you. You must either try to 
give the english equivalent, or say 'No answer'. After you nave mad« 
your response, I will tell you the correct english answer. You shoull 
try to take no longer than 10 seconds to make your response. You will 
be tested on the v/ords you learn several times, so you should try to 
retain them, but no discussions with other subjects, persons speaking 
German, or dictionary study are allowed." 

After answering any question by rephrasing the instructions, the 
first training session began. A standard anticipation procedure was 
used. On each presentation, the experimenter held up the card with 
the german word before the subject, and pronounced the word. (The 
experimenter had two years of undergraduate german.) After the sub- 
ject's response, the english equivalent was spoken by the experimen- 
ter. About 5 seconds later, the next stimulus was presented. The 
card decks were shuffled between trials. A Masonite divider sepa- 
rated the subject and the experimenter, so that neither the stimulus 
decks nor the data sheets were visible to the subject. 

Results 

In Table 10 is presented the mean error probability as a ^unction 
of condition, list and training day for each of the three tests. An- 
alysis of variance of these data indicated that the presentation 
technique was a significant variable during all three tests: 
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Table 10 



Observed error probabilities 
at each test interval and for each condition 
and predicted error probability 
during initial test, T-1, in parentheses 



Test 

Interval 


U 


S-5/5 


Condition 

S-7/3 


S-9/1 


P 


T-1 (10 min.) 


.190 


.178 


.192 


.380 


.212 




(.176)* 


(.193)A 


(.218) 


(.367) 


(.202)* 


T-2 (24 hr.) 


.228 


.242 


.294 


.459 


.314 


T-F (5 da.) 


.490 


.478 


.552 


.653 


.571 



* Used for parameter estimation. 
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T-1, F(4,28) « 17.9; T-2, F(4,28) « 16.1; T-F, F(4,28) « 6.2; 
p <.01 for each ratio, the day variable was significant durinn the 
final test only F(4,28) = 7.4, p <.01. No other sources of vaeianco 
were significant, thus, the lists were about equal in difficulty, 
and there was no evidence of "learning- to-learn" effects over days. 
The 9/1 split-list procedure was the least effective, followed by 
the part-list procedure. The whole-list and 5/5 split-list pro- 
cedures lead to similar performance on all tests, with somewhat 
more than half of the items being correctly recalled on the final 
test under these conditions. Error probability on the final test 
was a decreasing fup.ction of day number; i.e., the later in the 
training series a list was learned, the better the list was recalled 
on the final test. This result could be attributed to either greater 
delay between training and test, or to the fact that more lists were 
learned in the Interval, or both. 



In order to determine whether the data from the experiment may 
be described by the model, it is necessary to estimate the parameters 
a and This estimation was carried out by finding numerically 
those parameter values which minimized the squared deviations of pre- 
dicted and observed error probability on the first test using con- 
ditions W, P and S-5/5. The least squares estimates were a = .64 
and ^ = .999. The predicted values for each of the conditTons on 
the first test are presented in Table 1. According to model, the 
optimal division of the ten trials under the split-list condition 
should be 5.4 trials »or the first block. Thus the S-5/5 condition 
should be optimal, but a 7/3 division should be only slightly worse, 
according to the model. 



In addition to first test performance, it is possible to pre- 
dict performance during the training sessions. For example, let 
Tp(n.) be the expected total errors during the first n training 
trials of a list containing m items. It can be shown that 




where y * 



Observed total errors from the first five trials of the training 
series for condition P, the average of the three 3 conditions, and 
condition W provide information about learning under list lengths 
of 10, 25 and 50 items, respectively. (The first block data only 
were used from conditions P and S.) The observed total errors in 
five training trials were 2.41, 2.94 and 3.11, and the predicted 
errors v/ere 2.52, 2.58, and 2.67. The predictions are not bad, 
but leave something to be desired. (With ^ so close to 1, slight 
changes in this parameter cause big differences in the predicted 
values of the total errors. For example, if b is set equal to 
.996, the predictions are much closer to the observed values — 

2.62, 2.84, and 3.14. Test performance predictions become much 
worse with this estimate of b, however). 
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without additional assumptions, the model cannot generate 
predictions about performance on the second and final tests. It 
Is designed to account primarily for Interference-produced for- 
getting which may occur while a list Is being learned, and It Is 
well-known that considerable extra-experimental forgetting occurs. 

It Is important to note, however, that the relative ordering of 
performance under the different experimental conditions is the 
same for the long-term tests as well as the 1mmedic*.te test. The 
effects of the presentation procedures did not wash out over a 
period of about a week, and in this sense can be called long-term 
effects . 

In summary, the two-operator model handled several important 
aspects of the data very creditably. Given an estimate of the learn- 
ing and forgetting rate parameters for a subject matter and for a 
particular student population. It Is possible to predict perfor- 
mance with longer lists and the additional training which may be 
required to reach any criterion level. It appears that when It 
is convenient, long lists may be divided Into two sublists, and 
with optimal assignment by presentation time to each of the sublists, 
this split-list procedure will be very nearly as efficient as the 
whole-list technique. The model is inadequate in several respects, 
as mentioned previously. Its chief virtue is simplicity, arc’ 
so, problems of mathematical analysis arise. The theoretical arid 
empirical work described above nevertheless provide a stepping 
stone to further progress in the study of efficiency in learning. 
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Section IV. Acoustic and visual confusions in immediate memory 

Recent investigations of human memory suggest that the stim- 
ulus input may undergo several transformations over a period of 
perhaps less than a minute or so (Sperling, 1966; Wicklegren, 1966; 
Dale & Gregory, 1966). Restricting attention to visually-presented 
verbal stimuli, the sequence of information processing may take the 
following form: a brief visual storage period of no more thai* a 

few seconds, an auditory or acoustically-encoded period of around 
20 to 30 seconds (as if the subject had spoken or repeated the 
material to himself), and finally an associatively-encoded repre- 
sentation is achieved, which may last over in indefinite time in- 
terval. This representation of the functioning of human memory 
rests largely on the patterns of confusion errors in memory with 
variation in presentation and retention intervals. The primary 
study reported in this section was designed to shed further light 
on the effects on short-term recall of the relation between the 
mode of input--visual or acoustic — and the mode of output or testing, 
which used analogs of the visual and acoustic input procedures. 

(This study was conceived independently by William Chase, and he 
was largely responsible for its design and execution.) A second 
study was performed to obtain a visual confusion matrix based on 
perceptual errors. 



Method 

Subjects . --The Ss were 11 female introductory psychology stu- 
dents who received cTass credit for participation in the experiment, 
and one female graduate student who was paid for her participation. 

Design .— The experiment consisted of visual or acoustic p’-esen- 
tation of the list for memorization, visual or acoustic presentation 
of the test stimulus, and 3 types of material (visually confusing, 
acoustically confusing, or neutral letters), comprising a 3x2x2 
factorial within-Ss design. Each participated in five 45-minute 
sessions. On days 1 and 5 the Ss performed under all 4 combihv- 
tions of visual and acoustic presentation of the list with visual 
and acoustic presentation of the test, using as stimulus materials 
the digits 1-8. The first session provided the subject with some 
experience in the task; the last session permitted an evaluation of 
the magnitude of improvement over sessions. On days 2, 3, and 4 
the ^s received 4 tasks a day, and over the 3 days, received all 
combinations of 4 tasks and 3 types of material. The order of the 
12 conditions was counterbalanced by means of a 12x12 latin square 
with one S per rov/ of 12 conditions. 
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Materials and prociedure . — The stimulus materials consisted of 
lists of visually confusing letters (BCDGOQRU, cf.. Chase, 1965), 
acoustically confusing letters (BCDEPTVZ, cf. , Conrad, 1964), or 
neutral letters (ADHIMQYZ). With each type of material, lists 
of 1 , 2, or 4 letters \iere composed. Sixteen lists were made up 
at each length, with each letter being used an equal number of 
times. There were 8 lists with the test letter present and 8 lists 
with the test letter absent. Each letter within a type of material 
was used once as a test letter, and each serial position contained 
an equal number of positive test letters. The set of 48 lists, 16 
at each of 3 list lengths, was then randomly permuted, so that list 
length and type of material varied randomly within each task condi- 
tion. The same random sequence was used for all 4 conditions within 
a day. Three practice lists, one of each list length, preceded each 
set of 48 experimental lists. Prior to each condition within a day, 
vjas informed of the new task conditions. 

Visual stimuli v/ere typed on plain white index cards in capital 
elite lettev's. Memory lists were presented on 3x5 cards for a 5 
sec. interval. Test letters vjere on 5x7 cards centered in thi 
exposure field of a Polymetric two-channel tachistoscope. The 
letters subtended 33' of visual angle when viewed in the tachisto- 
scope. 

Acoustic stimuli were recorded in a female voice on a Wollen.aK 
stereophonic tape recorder. The memory list was presented twice 
at a rate of .5 sec. per letter, with the second presentation 
starting 2.5 sec. after the first. 

A warning click sounded 5 sec. after the start of the presenta- 
tion of the memory list, and tnen 2 sec. later, the test letter was 
presented, either visually or acoustically. The S's task was to 
press one of two buttons marked "YES'* or "NO", depending upon wheli'.er 
or not the test letter was a member of the previously memorized list 
of 1, 2 or 4 items. Half the Ss were assigned at random to press 
"YES" with their dominant hand and half with their non-dominant hand. 
The Ss were instructed to respond as quickly as possible without 
making any errors. They were also informed that there were an equal 
number of "YES" and "NO" responses arranged In a random sequence. 

The latency of each response was measured on a Standard Electric 
timer to the nearest .01 sec. 

Resul ts 

The median reaction time for correct responses was determined 
for each task by type-of -material by list -length combination for a 
S^, based on the pooled reaction times from 8 "YES" and 8 "NO" 
responses. A least squares estimate of the slope of the react 3 n 
time function over list lengths was then computed for each subject 
and for each of the 12 basic conditions. This slope measure, which 
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represents rate of search through the list in memory, is the main 
dependent variable. An analysis of variance showed that the only 
significant main effect was due to type of material, F(2,22) = 
4.68, p<.05. Rate of search through acoustically confusing lists 
was much slower (64.0 msec. /item) than through the visually con- 
fusing and neutral lists, which did not differ from one another 
(46.1 and 48.0 msec. /item, respectively). 

There was a significant interaction between mode of list 
presentation and mode of test presentation, F(l,ll) » 13.05, 
p 4.005. As can be seen in Table 11, search rates are much 
higher if the presentation and test is in the same mode (46.7 
ms/item) than if they are in different modes (58.8 ms/item). 

The intercepts of the reaction time functions were also 
analyzed. The only significant effect was mode of test presenta- 
tion; reaction time was faster to an auditory test stimulus than 
a visual test stimulus by 117 ms., F(l,ll) = 40.8, p<.001. 



The most significant results of this study were (1) the large 
effect of acoustically confusing material, (2) the absence of any 
effect due to visually confusing material and (3) the faster pro- 
cessing rates when identical Input and output modes are used. The 
first result is not surprising in view of the studies 
of Conrad and Wicklegren. For the time intervals used, there is 
good reason to believe that the primary representation of verbal 
material in short-term memory is of an acoustic or phonetic form. 

The second finding is somewhat surprising in light of an earlier 
study by Chase (1965) using identical materials and a very similar 
memory search task, in which search rates through the visually con- 
fusing material were slower than through either acoustically con- 
fusing or neutral material. We have not come up with a very satis- 
factory explanation for these contradictory results, but some prelim- 
inary ideas are currently being investigated. 

The last finding mentioned above has important bearing or 
theories of memory. The ubiquitous finding of acoustic confusions 
in memory would seem to imply a common format and storage location 
for verbal material after the first few seconds of processing have 
taken place. Since information is more readily available for test 
vjhen similar sensory modes are used for input and output, then either 
the "acoustic" memory tr ce might retain some type of sensory input 
tag which facilitates retrieval differentially depending on the out- 
put mode, or there might be different temporary storage locations 
for material input in visual or auditory modes, even though the form 
of the stored material in both locations might be acoustic. In any 
event, the mode of sensory input continues to be an important factor 
in retrieval, even after processing to a common format has been achieved. 



Discussion 
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Table 11 



Search rates in msec, per item as a function of 
modality of presentation of list and test 

Test Modality 





Visual 


Acoustic 




Visual 

List modality 


44.1 


57.4 


50.8 


Acoustic 


60,1 


49,2 


54.6 


1 


52.1 


53.3 


52.7 
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While selecting materials for the preceding study, a * view 
of the literature revealed that there existed no recent evatuation 
of visual confusions among letters of the alphabet. Consequently, 
the following study was carried out to obtain a complete visual 
confusion matrix. This matrix v/as to be compared with a visual 
overlap measure devised by Chase (1965) as a basis for selection 
of a visually confusable set for his study. 

Method 

Subjects .— The Ss were 77 introductory psychology students run 
in 9 groups of from 7 to 10 Ss per group. The records of 5 additional 
Ss were rejected because they did not have data for every tri< t. 

S timuli . — Block capital letters were photographed to produce 
a clear black image on a white background. The masking stimulus 
was a random black and white checkerboard pattern with an equrl 
number of black and white squares. 

Apparatus .— The stimuli were projected from behind onto an 
opaque glass screen 3" high by 10" w4<ie situated in the center of 
a 5‘ by 3' masonite panel. The S- were seated 7-10 feet from the 
screen. 

Two Anscomatic II slide projectors were used to present the 
letters and the masking stimulus. Two Wollensak alphax camera 
shutters were used to time the stimulus presentations of the letters 
and blanking stimulus. The intertrial interval was timed with a 
cam-timer controlling, a Cramer model 940A timer which controlled 
onset of the letter stimuli and the masking stimulus, respectively. 

Procedure . — The sequencing of events during a trial was as 
follows. The trials were spaced 5 seconds apart. A brief apparatus 
click served as a warning signal, followed in 1/2 sec. by the letter 
and masking stimulus. The letter was exposed for about 20 msec., 
followed by about 30 msec, of darkness, and then followed by a 
masking stimulus for 1 1/2 seconds. The masking sitmulus was used 
to control precisely the duration of the visual afterimage. 

A preliminary study was conducted in which the time between 
letter onset and masking stimulus onset was varied in order t^ de- 
termine an exposure duration which would yield about 50% error . 

The duration chosen was approximately 50 msec. 

Two hundred and sixty slides, 10 copies of each letter, w 
arranged in random order, 40 slides in each of 6 trays, and a 
final tray with 20 slides. The order of presentation of the 7 
trays of letters was counterbalanced across the 9 groups so that 
each tray appeared at least once and not more than twice in any 
serial position. 
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The answer sheets consisted of 14 columns of 29 boxes on a 
single sheet. Every other column was separated by a double line 
so as to correspond to a block of 40 trials. The Ss were instructed 
to make a response in every box, even if they had to guess. They 
were first shown a few exposures to acquaint them with the apparatus. 
The duration was gradually decreased to the exposure duration used 
in the experiment so the Ss could first get an idea of the letters 
used and the exposure duration used. They were then run on 40 
trials at a pace of 5" per exposure while the experimenter called 
off the number of the trial in advance. A short rest was given 
after each block of 40 trials while the experimenter changed slide 
trays . 



Results and Discussion 

The stimulus-response Confusion matrix is presented in Table 
12. The error rate varied from 60 to 90 percent for various items. 
Substantial response bias was observed, some letters occurring as 
responses with twice the frequency of others (e.g. Z vs L). “":.e 
pattern of confusions is generally along the lines to be expected 
from structural considerations, with some exceptions. For example, 
the following sets are mutually ccnfusable: C-G, E-F, I-J-L, 0-Q, 

K-X, and V-Y. Hov/ever, the following sets are not confused, though 
it would have been, predicted from a visual overlap measure (Uiusa, 
1965): A-H, C-D, U-V, etc. The confusion matrix is being treated 

by the multi -dimension monmetric scaling procedure of Shephard (1954) 
in an effort to find a space of fewer than 26 dimensions to describe 
the matrix. Tentative results lead to the conclusions that (1) a 
space of about 6 dimensions is required to describe the data, and 
(2) the dimensions represent specific complex combinations of the 
sort represented by E-F vs I-J-L, and C-G versus 0-Q, rather than 
more general characteristics such as vertical and horizontal 11."'s, 
curved segments, etc. Additional work remains to be done in analyzing 
these data. 
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Section V: Short- tenn memory in children 

The studies reported in this section investigated short-term 
retention in children as a function of the amount to be retained* 
and information prior to presentation about the amount to be retained. 
The initial study will be described only briefly, since it is already 
available in a published report (Calfee, Hetherington a Walzer, 1965). 
The study was designed to investigate the effects of v^i thin-subject 
variation in list length on STM of pre-school children* 



Method 

The Ss were 38 preschool shildren between 3,5 and 5 yearr of 
age (X=4.T, S.D.*.51) from the Unitarian Society Nursery Schoc \ 
Madison, Wisconsin. Each child was asked if he would like to play 
a game; if the child was willing, he was brought to the experimental 
room. There he was shown a collection of small toys, and told that 

he might choose one toy as a prize for playing the well. All 

Ss received the toy of their choice at the end of the session. 

Three ^s who asked to be run again were given a second session 
a week or more after the first session. There were no noticeable 
differences between sessions, and these data are included in the 
analyses to follow. 

The stimulus materials consisted of a set of eleven brightly 
colored animal cards. Each card was shown to S^, who was asked to 
name the animal. If there was no response, was told the answer, 
and the process was repeated until S was familiar with each card in 

the deck. The experiment consisted of 24 trials*^jeacb triaV re- 

quiring about 1 minujtei_ for a total sesslorPtime of about half-an- 

^hournr 



On each trial a subset of 3, 4 or 5 cards was randomly selected 
for presentation. The randomization for a child was arranged so 
that each display size v;as used for 8 trials, and each serial position 
was tested at least once for every display size. The selected cards 
were shown one at a time to S for a 1 sec. interval, S called out the 
name of the animal, and then the card was placed face-down in a hori- 
zontal array in front of S_. After the last card was presented, a 
cue card which was identical to one member of the presentation set 
was held up, and S was asked to turn up the matching card in ..le 
array. If the response was incorrect, continued to turn up addi- 
tional cards until a match was obtained. The intertrial interval 
was about 40 sec., during which time E arranged cards for the next 
trial and chatted with S. 
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Results and Discussion 
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In the bottom marginals of Table 13 is presented the mean pro- 
portion of correct responses at each position for the three display 
sizes. Position 1 corresponds to the last card displayed prior to 
the retention test. The proportion of correct responses is a de- 
creasing function of the number of cards intervening between the 
test position and recall. The probability that the last card in 
the display is correctly identified when it is the test posit jn 
varies between .85 and .90, and appears to be unrelated to display 
size. It may be that this performance represents the best obtain- 
able with preschool children in this task. The proportion of correct 
responses in Position 2, the next-to-last card presented, var'^’es frcm 
.34 to .67, and is monotonically related to display size. As tne 
display size is decreased (i.e., as fewer cards are presented), S_ 
is more likely to recall the card presented in Position 2, and the 

• amount of this improvement is greater than can be accounted for by 
changes in the guessing rate. 

The number of times each position was the first choice at each 
test position is presented in the cells of Table 13 for the three 
display sizes. Investigation of these data indicates no generaliza- 
tion around the correct position was observed. (Such generalization 
was observed in a study by Atkinson, Hansen, and Bernbach, 1364.) 
Rather, there was a tendency at all display sizes to choose one of 
the middle cards In the array when an error occurred. This tendency 
is apparent in the distribution of error probabilities in the right- 
hand margin of Table 13. 

Sittce-there appeared to be no generalization with the relatively 

small display sizes used in this study, the hypothesis was entertained 
that, when the cue card was presented, either the child was able to 
retrieve the position of the matching card from a short-term store, 
or else the child simply guessed at random according to the non- 
uniform distribution in Table 13. The following analysis was carried 
out with this hypothesis in mind. Each child was allowed to turn 
up cards until the matching card appeared. In Table 14 is pr ientei 
the mean number of cards turned up at each test position, based on 
those trials when the first response was incorrect. The predicted 
values are obtained from the all-or-none retrieval hypothesis men- 
tioned above, and the empirical error distributions in Table 
(Independence of successive choices from irrelevant alternatives 
was also assumed, so that following the choice of a card, the error 
probabilities for the next choice were obtained by eliminating that 
choice and renormalizing the empirical frequencies. For example, if 
the initial choice of Position 1 was incorrect on a trial, the theo- 
retical probability of choosing Position 2 next would be .74 for dis- 
play size 3.) The predicted values in Table 14, in fact, give a very 
good account of the observed data. If there is any noticeable trend 
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Table 13 

Frequency of first response i i each position for 
each test position and error frequency distributions 

for three display sizes. 



First Test Position Total P (E) 

Response 12 3 45 Errors 







3 - 


Item 


List 








1 


93 


12 


10 


mm 


- 


22 


.18 


2 


12 


72 


61 


- 


mm 


73 


.60 


3 


4 


22 


42 


mm 


- 


26 


.22 


P(E) 


.85 


.68 


.38 














4 - 


Item 


List 








1 


71 


14 


4 


4 


- 


22 


.15 


2 


7 


40 


32 


16 


- 


45 


.31 


3 


2 


16 


35 


33 


CM 


51 


.36 


4 


3 


11 


n 


29 


- 


25 


.18 


P(E) 


.85 


.49 


.43 


.35 




- 








5 - 


Item 


List 








1 


62 


9 


8 


5 


1 


23 


.14 


2 


3 


23 


15 


14 


6 


38 


.20 


3 


1 


25 


25 


23 


26 


101 


.40 


4 


1 


8 


12 


13 


8 


42 


.16 


5 


0 


2 


6 


10 


22 


18 


.10 


P(E) 


.92 


.3^! 


.38 


.20 


.35 
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to the discrepancies, the predicted values tend to be slightly 
higher thah the observed, which might indicate that the subject 
may occasionally have some idea about the correct position, even 
though the first response is wrong. This Interpretation would 
accord with the finding of both Atkinson, et al, (1964) and Han- 
sen (1965) that the second choice, following an initial error, 
tends to be correct more frequently than chance would predict. 

The second study in this series was originally designed as 
an extension of the first study, using a wider range of list 1 -'‘•hs, 
varied again vyithin subjects. In the first study, the variation in 
size of the list was quite small in absolute terms (3 to 5 items), 
while in the present study, each child was shown lists of 4, 8 or 
n i terns. A substantial decrement in performance occurred even with 
short list lengths for the wider-range condition, and the behavior 
of the children tended to follow inefficient stereotypes, such as 
looking for the test card by starting at one end of the list and 
working toward the other end. The thought occurred to us that, with 
the wider range of list length variation. Information about the num- 
ber of cards to be shown prior to each list presentation might serve 
an important role both ..i re-establishing a stronger recency effect, 
and also by permitting other, "memory allocation", mechanisms to 
come into play. 



Method 

The subjects in Group MK (No Knowledge) were 16 children (mean 
age, 4.6 years, range, 4. 2-4. 9) from the Preschool Laboratory, Uni- 
versity of Wisconsin, Madison who were tested during the fall of 
1965 by Pat Gottleib. Each child participated in two testing ses- 
sions, with an interval of from 6 to 8 weeks between sessions. In 
the first session, a presentation interval of 2 sec. per card was 
used; in the second session, the presentation interval V’/as 4 sc*c. 
per card. There was no significant difference between session , so 
the data were collapsed over sessions for analysis. 

The subjects in Group K (Knowledge) were 14 children from the 
Neighborhood House School, Madison (mean age, 4.5 years, rang^ 3.5- 
5.0) , who vyere run in the spring of 1966 by Rogina Polesta. Each 
child participated in a single session, in which a 2 sec, presenta- 
tion interval was used. 

Each child was asked if he would like to play a game; if the 
child was willing, he was brought to the experimental room, .There 
he was shown a selection of toys in a box and told that he could 
choose any toy as a prize for trying hard and playing well. All 
Ss received the toy they desired at the end of the session. 
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The stimulus materials consisted of the same set of eleven 
animal cards used In the first study. Each card was shown to the 
who was asked to name the animal. If the did not know the name, 
he was told the ansv^/er, and the process was repeated until knew 
the names of all the cards in the deck. The experiment consisted 
of two sessions of 23 trials each. Each trial with a present ‘"‘on 
interval of two sec. lasted about a minute, for a total session 
time of about 30 minutes; each trial with a presentation interval 
of four sec. lasted about minute and a half, for a total session 
time of about 40 minutes. 

On each trial a subset of 4, 8, or 11 cards was randomly 
selected for presentation. The randomization for a S was arranged 
so that for each subset, every serial position was tested once oer 
session. The selected cards were shown one at a time to S for a 
two sec. interval In the first session and a four sec. Interval In 
the second session. S called out the name of the animal, and the 
card was placed facedown In a horizontal row In front of S. After 
the last card In the subset was presented, a cue card which was 
Identical to one card of the presentation was held up, and S 
was asked to turn up the card which he thought matched the cue card. 
If his response was incorrect, S was allowed to continue turning 
up cards until the correct card' was obtained. The intertrial inter- 
val was about 15 sec. duringr which time E picked up the completed 
trial and took out the cards previously arranged for the next trial. 



In Group K, the following modification to the basic procedure 
was introduced to give the child Information about the number of 
cav-ds to be presented prior to each list. The cards were laid out 
during the presetnation on a 30x8 In. board, which was marked >ff 
Into 11 sections by red stripes. Prior to each list, the experi- 
menter covered all positions not to be used in the next list with 
a masonite cover, so that prior to a 4-1 tern list, only 4 sections 
remained uncovered, etc. The purpose of the board was explai. : to 
the child following the familiarization training. 

Results and Discussion 

In Table 15, the probability of a correct response at each 
serial test position is presented for each group and the three list 
lengths. Of the 23 entries in Table 15, there are only two for 
which the performance of Group HK is superior to Group K. The 
overall mean probability of a correct response shows the performance 
of the children with prior knowledge of the number of cards to be 
clearly superior to the no-knowledge group (.37 versus .24). The 
largest differences occur when the most recently presented cards 
are tested, but substantial performance gains also are observed for 
the first card presented as well as intermediate positions. 
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Table 15 



Probability of correct response at each serial 
position as a function of list length with 
knowledge (K) and no knowledge (NK) of number 
of items to be stored away 



List 

Length 



4 



8 



11 



Position 





1 




2 




3 




4 










K 


1.00 




.64 




.21 




.29 










NK 


.72 




.32 




.19 




.26 












1 


2 


3 


4 


5 


6 


7 


8 








K 


.93 


.50 


.29 


.21 


.21 


.07 


.29 


.36 








NK 


.71 


.46 


.35 


.13 


.09 


.00 


.10 


o 

CM 

• 










1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


K 


^ .93 


.57 


.43 


.36 


.29 


.29 


.07 


.07 


.07 


.07 


.36 


NK 


.85 


.26 


.10 


.10 


.23 


.23 


.10 


.07 


.00 


o 

o 

• 


.19 
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It Is true, of course, that other variables are confound'* i 
with the knowledge-no knowledge dimension — viz., subject pO|.j- 
lation, time of year, and experimenter. It is our opinion that these 
Other variables do not contribute significantly to the observed 
differences in behavior. Not only is there a large performance 
difference; the pattern of responding is quite different in GivLip K. 
For example, very little choice stereotypy is observed in Group k. 
Children in this group tend to select cards in the vicinity of the 
correct test card; i.e., the response generalization observed by 
Atkinson, Hansen and Bernbach, 1965, is also observed in Group K. 

The markedly smaller amount of generalization in Group NK is 
apparent by inspection of the test-response matrices in Table 16. 

A more concise comparison of the relative amounts of generalization 
is obtained by comparing average absolute deviations of first posi- 
tion chosen from correct position, shown in Table 17. When the 
children in Group NK make an error at any position, they tend to 
make a much wilder guess than children in Group K. 



The improved performance of the children in Group K might be 
attributed in part to a Von Restorf effect. With prior knowledge 
about list length, the last card or two might have special saliency. 
From a limited -capacity hypothesis of short-term memory, the reduced 
retention at the most recent positions should be accompanied by 
better recall at some earlier positions. In fact. Group K subjects 
exhibited better retention at all positions, including the first 
card presented. 

While additional research needs to be performed to rule out 
alternative explanations, an interesting possibility raised b • 
these results concerns the utilization of short-term memory capa- 
city by a subject. Most theories of short-term memory have assumed, 
either implicitly or explicitly that the primary memory system 
consists of a fixed capacity buffer in which incoming verbal mater- 
ial is stored in a more or less serial fashion (Atkinson & Shrnrin, 
1965; Bower, 1966; Waugh & Norman, 1965). The data from the present 
study suggest that the subject may allocate space in the short-term 
memory system in a more dynamic fashion. Prior information about 
memory requirements may be useful in setting up the memory system 
so that more efficient storage becomes possible. Hypotheses about 
the mechanisms responsible for the increase in efficiency must neces- 
sarily be speculative at present. One possibility is that the sub- 
ject attempts to organize incoming material on the basis of first 
item (or two), last item or two, and middle items, where organiza- 
tion entails storage in separate “memory bins". When a test card 
IS presented, the subject proceeds in a hierarchical fashion, 
asking first whether the test card is first, last or middle, then 
tests within aeneral position. Free recall studies of subjective 
organization (e.g., Tulving, 1966) indicate that encoding strategies 
similar to that just described play an important role in retention 
of verbal material. In the absence of information about list length, 
the subject would not be able to carry out this operation. 
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Table 16A 



Frequency of first response at each test position 











for Group K 












First 










Test Position 










Response 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 












4-Item List 










1 


22 


11 


9 


9 
















2 


8 


10 


9 


9 
















3 


0 


3 


6 


5 
















4 


1 


7 


7 


8 


























8-Item List 










1 


22 


5 


6 


5 


8 


12 


8 


9 








2 


0 


14 


6 


8 


6 


4 


3 


1 








3 


4 


5 


11 


7 


4 


4 


6 


3 








4 


1 


3 


2 


4 


5 


4 


6 


4 








5 


0 


0 


3 


1 


3 


4 


4 


6 








6 


0 


0 


0 


1 


2 


0 


0 


0 








7 


0 


0 


0 


2 


1 


1 


3 


2 








8 


4 


3 


3 


3 


2 


2 


1 


6 


















11 -Item List 










1 


26 


1 


7 


3 


5 


3 


7 


3 


7 


4 


6 


2 


2 


4 


3 


10 


3 


2 


2 


3 


4 


3 


2 


3 


1 


4 


3 


2 


1 


2 


2 


2 


1 


1 


1 


4 


0 


2 


3 


3 


3 


7 


2 


4 


3 


2 


3 


5 


1 


4 


9 


3 


7 


1 


7 


6 


2 


5 


4 


6 


0 


6 


1 


4 


3 


7 


4 


5 


3 


1 


4 


7 


0 


1 


3 


1 


3 


3 


4 


3 


3 


6 


2 


8 


0 


2 


2 


1 


2 


2 


0 


2 


2 


2 


1 


9 


0 


0 


0 


0 


0 


2 


2 


1 


0 


2 


0 


10 


0 


0 


0 


0 


0 


0 


0 


0 


0 


0 


2 


11 


1 


3 


2 


4 


4 


2 


3 


2 


6 


5 


6 
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Table 16B 

Frequency of first response at each test position 

for Group NK 

First Test Position 



Response 1234 



1 14 0 1 0 

2 0 9 8 1 

3 0 4 3 9 

4 0 19 4 



1 13 2 0 0 

2 17 4 5 

3 0 14 6 

4 0 2 5 3 

5 0 110 

6 0 0 0 0 

7 0 0 0 0 

8 0 10 0 



1 13 1 1 0 

2 0 8 4 4 

3 0 16 2 

4 12 2 5 

5 0 10 1 

6 0 0 1 1 

7 0 0 0 1 

8 0 0 0 0 

9 0 0 0 0 

10 0 0 0 0 

11 0 10 0 



5 6 7 8 9 10 11 

4-Item List 



8-Itera List 

0 0 0 0 

2 3 0 3 

13 2 1 

6 4 2 0 

3 2 2 1 

113 2 

0 14 2 

1.0 1 5 

11-Item List 

0 0 0 0 0 0 0 

2 110 2 10 

2 10 10 11 

4 1 2 4 0 1 2 

4 3 5 2 0 1 2 

0 4 3 3 1 1 1 

0 3 1 1 5 4 0 

2 0 1 1 3 0 2 

0 0 10 12 0 

0 0 0 0 0 1 1 

0 1 0 2 2 2 5 



« 
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Table 17 

Average absolute deviation between correct 
test position and first response 



List 

Length 



8 



11 



Position 





1 




2 


3 




4 




K 


0 




1.2 


1.1 




1.1 




NK 


1.2 




1.3 


1.4 




2.2 






1 


2 


3 


4 5 


6 


7 


8 


K 


1.0 


2.3 


1.1 


1.5 1.6 


2.5 


2.2 


3.5 


NK 


2.2 


1.2 


2.1 


2.0 2.6 


3.3 


4.1 


4.8 




1 


2 


3 


4 5 


6 


7 


8 



10 11 



K 
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Section VI. General Conclusions and Implications 

The data from all the studies in this series support the :"en" 
eral conclusion that retention of single stimulus -response un*. s 
Is determined not only by the number of times such a unit has been 
presented for study. The context established by other, concurrent 
events— number and spacing of other items to be learned, the state 
of learning of those items, compatibility of input and output .r.j’es, 
prior information about memory load— is a significant factor deter- 
mining the relative efficiency of a reinforcement or feedback inter- 
val. A second conclusion concerns the adequacy and usefulness of 
stochastic models for verbal learning in handling data from more 
complex situations, and in directing us toward optimal strategies 
for presentation of verbal materials. The theoretical models dis- 
cussed provide a good first-order account of the data from several 
experiments reported in this paper. In several instances, alter- 
native models can be formulated, based on quite different represen- 
tations of the underlying psychological processes, yet it is not 
possible to choose between these alternatives. This is neither 
surprising nor disturbing. The choice of a particular theoretical 
model “depends upon its usefulness in a variety of situations, direct 
tests or* basic assumptions, and comparison with reasonable alter- 
natives. 

None of the models proposed was entirely satisfactory in ac- 
counting for all the data reported. In particular, various non- 
parametric predictions, such as constancy of the total errors or 
errors on first presentation in Experiment I, provide strong evidence 
that alterations will be required. At the same time, the analyses 
of the data suggested by the models provide hints as to what form 
such alterations should take. 

The practical usefulness of theoretical models in providing 
solutions to educational problems remains to be seen. For one thing, 
as soon as models of moderate complexity are used as the basis for 
optimization work, the task of theoretical analysis becomes to.. j id- 
able. Second, the variables which have been considered so far, such 
as block size and spacing, do not appear to be especially potent. 
While it is true that very inefficient procedures can be found (e.g., 
breaking a list into parts that are too small, repeating stimulus- 
response pair several times in succession), there apparently exists 
a fairly large set of presentation procedures of more or less equi- 
valent efficiency (e.g., the various spacing procedures in Exp. II, 
and Conditions W, S-5/5 and S-7/3 In the german-english experiment 
reported in Section III.) Further advances in optimization will de- 
pend on development of more adequate models, and extension of the 
research into areas marked by more potent variables. A prime candi- 
date for such an extension would appear to be the use of encoding 
strategies involving organization and structuring of incoming infor- 
mation. 
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Conclusions of a nore specific sort have been presented under 
the Discussion heading .n each of the preceding sections, and will 
not be repeated here, since they are best considered in conju... - .Jn 
with the details of method and analysis of the data. A 
implications of potential significance to educational practice mi glu 
be mentioned. First, not every reinforcement or feedback is effec- 
tive in a learning situation. When material is presented for study, 
the student may essentially disregard the information, and so waste 
the time. The mechanisms underlying the effective use of study in- 
tervals is not known, but certain conditions producing ineffective 
learning can be specified. Immediate repetition of the same infor- 
mation is extremely wasteful. When a list of items is to be PJ^“ 
sented several times, successive repetitions of a given item snouio 
be spaced out, for example, by using a whole-list rather than a 
part-list procedure. However, the major increase efficiency occurs 
when immediate repetitions are avoided. The relative effectiveness 
of spaced presentations may arise in part because of 
likelihood that errors will occur. An interesting question which 
is raised in a new way by these studies, but not answered, concerns 
the role of errors in acquisition of lists of verbal Information. 

A second implication concerns short-term memory processing by 
young children (and perhaps by older children and adults as well). 
There is a significant improvement in ability to retain and util- 
ize a list of items if the student has some idea about how long the 
list is going to be prior to presentation of the list. If it 
the case that there are wide variations in the amount of info, na- 
tion in successive lists, then "memory load" instructions can sub- 
stantially increase retention. 

Finally, although it appears that the human information 
cessing system may convert verbal information into a common (acous- 
tic) format at some level, input modality still remains a significant 
variable. If a word is presented visually, and must be remembered 
for a brief period of time, after which a test item is to be com- 
pared with the memory item, the comparison is more easily accom- 
plished if the test item is also presented visually. Crop-modal- 
ity memory comparisons are more difficult to carry out. Such com- 
parisons play an important role in many skill areap such as reapno. 
If it is desirable to simplify a task where the primary prepntation 
mode is, of necessity, visual, and the test mode must be auditory, 
one possibility is to enrich the presentation mode by an accompanying 
auditory presentation. 
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